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Abstract 

The oft-neglected issue of tlie causal structure in the flat spacetime 
approach to Einstein's theory of gravity is considered. Consistency 
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requires that the flat metric's null cone be respected, but this does 
not happen automatically. After reviewing the history of this prob- 
lem, we introduce a generalized eigenvector formalism to give a kine- 
matic description of the relation between the two null cones, based 
on the Segre classification of symmetric rank 2 tensors with respect 
to a Lorentzian metric. Then we propose a method to enforce special 
relativistic causality by using the naive gauge freedom to restrict the 
configuration space suitably. A set of new variables just covers this 
smaller configuration space and respects the flat metric's null cone au- 
tomatically. In this smaller space, gauge transformations do not form 
a group, but only a groupoid. Respecting the flat metric's null cone 
ensures that the spacetime is globally hyperbolic, indicating that the 
Hawking black hole information loss paradox does not arise. 



1 Introduction 



A number of authors have discussed the utility of a flat background met- 
ric r/^^ in general relativity or the possibility of deriving that theory, ap- 
proximately or exactly, from a flat spacetime theory |l-71, 73|-8ll, 8^, 85- 



126 , 12£ , 131 , 132 1 . Some have permitted the background metric to be curved 



[75, 91,103-105,133-138], but our interest is in flat backgrounds only, be- 
cause they are uniquely plausible as nondynamical entities. The use of a 
background metric enables one to formulate a gravitational stress-energy 



tensor [102|, not merely a pseudotensor, so gravitational energy and mo- 
mentum are localized in a coordinate-independent (but gauge- variant) way. 
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It also enables one to derive general relativity and other generally covari- 
ant theories from plausible special-relativistic postulates, rather than pos- 
tulating them [0-|8|, |5|^, |0|^, |6|, |3|-0, H, ^M^^ ^M^' M,^,^- 
g7ipnH9lP MO0| , P^ , [l^P^M2l ,|T26|. It is worth recalling a conclusion 
of E. R. Huggins |^9|, who was a student of Feynman. Huggins found that 
the requirement that energy be a spin-two field coupled to the stress-energy 
tensor does not lead to a unique theory, because of superpotential-type 
terms. Rather, "an additional restriction is necessary. For Feynman this 
restriction was that the equations of motion be obtained from an action 
principle; Einstein required that the gravitational field have a geometrical 
interpretation. Feynman showed these two restrictions to be equivalent." 
[^9| (p. 3) Because other derivations have built in the requirement of an ac- 
tion principle already, it is no surprise that Riemannian geometrical theories 
are the unique result. As W. Thirring observed, it is not clear a priori why 
Riemannian geometry is to be preferred over all the other sorts of geometry 
that exist, so a derivation of effective Riemannian geometry is attractive 

Casting gravitation in the same form as the other forces provides another 
reason to consider a field approach: the flat metric's null cone provides a 
causal structure for defining dynamics in quantum gravity, which otherwise 
is lacking. The lack of an a priori fixed causal structure is merely technically 
demanding at the classical level, but it constitutes a real puzzle at the quan- 
tum level, for one no longer knows how to write equal-time commutation 
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relations, for example, because one needs to know the metric in order to de- 
termine equal times, but the metric is itself quantized: a chicken-and-the-egg 
problem. 

Such derivations of general relativity and related theories, however, are 
perhaps only formally special relativistic, because the curved null cone might 
not respect the flat one. This difficulty afflicts not merely our derivation 



1 121 1, but in fact all derivations in this tradition, and implies that the alleged 
resemblance of Einstein's theory to other field theories in this approach is 
merely formal, for all that has been shown to date. We survey in some detail 
the treatment of this fundamental question over the last six decades. As it 
happens, this issue has in general been ignored, explained away, postponed 
with the hope that it would go away, or mishandled, although there have 
been positive signs in recent years. We critique claims that the problem is 
insoluble and claims that it has already been solved, and conclude that the 
issue remains quite open. 

Next we undertake to solve the problem. The kinematic issue of the 
relationship between the two null cones is handled using the work of G. 
S. Hall and collaborators on the Segre classification of symmetric rank 2 
tensors with respect to a Lorentzian metric. For our purposes, we classify the 
curved metric with respect to the flat one, and find necessary and sufficient 
conditions for a suitable relationship. Requiring that flat spacetime causality 
not be violated, and not be arbitrarily close to being violated, a condition 
that we call "stable ry-causality" , implies that all suitable curved metrics have 
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a complete set of generalized eigenvectors with respect to the flat metric, 
and that the causality conditions take the form of strict rather than loose 
inequalities. Given strict inequalities, one is in a position to solve such 
conditions, which are somewhat analogous to the "positivity conditions" of 
canonical gravity, which have been discussed by J. Klauder, F. Klotz, and 
J. Goldberg. In these new variables, stable ry-causality holds identically, 
because the configuration space has been reduced (though the dimension is 
unchanged), largely by reducing the lapse until the proper null cone relation 
holds. This reduction implies the need for reconsidering the gauge freedom of 
the theory. It turns out that gauge transformations no longer form a group, 
because multiplication is not defined between some elements. But they do 
form a groupoid, which seems quite satisfactory. Given the satisfactory 
outcome of the effort to make the proper null cone relationship hold, the 
above-mentioned derivations of general relativity as an ostensibly special 
relativistic theory in fact succeed. The naive gauge freedom turns out to 
include some unphysical states, but that is not a serious problem. 

Making the curved metric respect the flat null cone ensures that the re- 
sulting spacetime is globally hyperbolic. This fact is quite consistent with the 
existence of a region of no escape, which can arise due to the inward tilting 



of the curved null cones [107|. Given that global hyperbolicity apparently 
pulls the fangs from the Hawking black hole information loss paradox, it 
appears that this paradox does not afflict the special relativistic approach 
to Einstein's theory of gravitation. This lack of paradox indicates that the 
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flat metric can be employed such that it is not merely a formal mathematical 
trick, but rather has beneficial physical consequences. 



2 Bimetric General Relativity and Null Cone Con- 
sistency: A History Since 1939 

As we have seen, the use of a flat metric tensor 77^,^ in gravitation has 
received a fair amount of attention over the last six decades or so. However, 
the interpretation of the resulting bimetric or field formulation of general 
relativity has not been adequately clarified, due to an ambiguous notion of 
causality: the effective curved metric which determines matter propagation 
is not obviously consistent with the flat background causal structure. Having 
a consistent relationship is clearly a necessary condition for a true special- 
relativistic theory. 

Whether it is sufficient is unclear from anything said so far, because the 
propagation of gravity itself or of nonminimally coupled matter fields can 



yield more subtle behavior |15£, 160 , |163| . However, we will see below that 
this null cone condition indeed is sufficient due to the presence of a well 
posed initial value formulation. 

We now sketch the history of the flat null cone issue from roughly the 
late 1930s till the present. We do not consider the period between 1905 and 
the late 1930s, though that might be an interesting project, which would 
consider the time after special relativity had solidified and include the in- 
vention of Einstein's theory of gravitation. (L. P. Grishchuk mentions a bit 



of the history in works of Poincare and Einstein |94]. Fang and Fronsdal 
sketch the history of the flat spacetime approach up to 1979 [36|, but neglect 
to consider the null cone issue.) Rather, we start with the rebirth of the flat 
spacetime approach to gravity, with works by Fierz, Pauli, and Rosen. While 
the importance of the problem perhaps seems evident in retrospect, the ne- 
glect of it in the literature suggests that it in practice was not so obvious, or 
that influential radical empiricist philosophies obscured it. One can roughly 
divide the issue's history into three periods, though at times we will disre- 
gard the historical boundaries to be able to discuss an author's whole work 
in a unified way. For the first 20 years (1939-1959), the problem seems not to 
have been recognized or mentioned in print (to our knowledge). For the next 
two decades (1959-1979), it was sometimes mentioned, but either resolved 
incorrectly, dismissed as unimportant, or postponed with the hope that it 
would disappear. More recently (1979-2001), it has been recognized more 
often, and occasionally regarded as worthy of sustained attention. A few 
authors have attempted either to solve it or to prove it insoluble. However, 
we disagree that either of these goals has been achieved, and will undertake 
to show why. One also finds authors continuing to ignore this problem even 



in the last few years ^6, 123 1, and even within a review article [117]. When a 
fundamental issue is either ignored or mishandled for a long time, a critical 
history of the subject becomes necessary, so we provide it here. 

One should perhaps distinguish between two null cone problems. The 
first is: given that one regards Einstein's equations as describing the evo- 
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lution of an effective curved metric in Minkowski spacetime, what does one 
make of the potential violation of Minkowski causality by matter responding 
to the curved metric? The second is: given that one chooses to quantize the 
geometrical (single-metric) theory, what does one make of causality without 
a metric to define equal-time commutation relations? However, these prob- 
lems are related, and we believe that the SRA as presented here solves them 
both, so we will treat them together. 

2.1 The Years 1939 to 1959: the Null Cone Consistency 
Problem Ignored 

Around 1940, in his seminal papers on the bimetric description of general 
relativity, N. Rosen suggested that there ought to be some (gauge-fixing) 
relation between the flat and curved metrics, because one expects that the 
two coincide if the gravitational field vanishes |Q (p. 149). While this 
paper did not consider the meaning of the bimetric formalism in detail, 
its companion paper (p. 150) considered interpretive issues. Rosen wrote 
(apart from a change in notation to match ours), 

[f]rom the standpoint of the general theory of relativity, one must 
look upon rj^i, as a fiction introduced for mathematical conve- 
nience. However, the question arises whether it may not be pos- 
sible to adopt a different point of view, one in which the metric 
tensor r]^u is given a real physical significance as describing the 
geometrical properties of space, which is therefore taken to be 
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flat, whereas the tensor g^j^ is to be regarded as describing the 
gravitational field. ||3| (p. 150). 

Rosen recognized that the flat spacetime view implies that the speed of light 
measured with ideal rods and clocks undistorted by gravity will differ from 
unity (p. 153), but he seems not to have addressed the possibility that 
it might exceed 1. While his approach merely postulated bimetric general 
relativity, he did suggest that it would be desirable to derive it independently 

(p. 153). His intention to carry out this procedure himself |^ (p. 153) 
seems not to have been realized, but many others have done it since that 
time, as we have seen. 

During the 1940s, with some war-time inconvenience in Greece, A. Pa- 
papetrou was able to express general relativity in an attractive form resem- 
bling electromagnetism, with the theories being expressed in the tensorial 
DeDonder and Lorentz gauges, respectively Q. He emphasized the im- 
proved nature of the conservation laws, especially for angular momentum, 
and found that certain attractive relations that have no invariant meaning in 
the geometrical view become perspicuous given the flat spacetime interpre- 
tation. Papapetrou held that for the flat spacetime approach, gauge-fixing 
to tie together the two metrics was "indispensable" (p. 20), because the 
energy-momentum and angular momentum localization would suggest phys- 
ically distinct systems given different relations between the two metrics. He 
was aware of Rosen's result that the flat spacetime interpretation implies a 
varying speed of light (using unrenormalized instruments), but seems also 
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to have failed to entertain the possibihty that the gravitational field might 
make light travel faster than in special relativity. 

The neglect of the null cone issue continued well into the 1950s in the 
important works of S. N. Gupta p"^-[l^ and R. H. Kraichnan [jl^ Q . At this 
stage the derivation of the exact nonlinearities of general relativity, which 
Rosen had desired, was achieved. Concerning the special-relativistic nature 
of the theory, both authors seem to have regarded the Lorentz covariance of 
the theory as sufficient for special relativity. If the theory's gauge invariance 
and the unobservability of the flat metric are mentioned, the idea that the 
observable effective curved metric might well conflict with the flat metric is 
not. This is an important distinction that will also be overlooked repeatedly 
by later authors. One could imagine that the flat metric might fail to appear 
in the equations of motion, but still have its null cone serve as a bound on 
the curved metric's null cone, so this distinction is crucial. The flat metric 
might have important qualitative consequences, even without having any 
quantitative role in the field equations. 

F. J. Belinfante, interested in the work of Papapetrou and Gupta and in 
particular in the solidifying covariant perturbation approach to quantizing 
gravity, contemplated the use of a flat metric in "Einstein's curved universe" , 



which evidently meant the geometrical theory of gravity [19|. Working in the 
context of the static Schwarzschild solution (in which it is difficult to get the 
relation between the two null cones wrong, at least outside the Schwarzschild 
radius, unless one tries to do so), Belinfante only had occasion to consider the 
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null cone relationship incidentally. But the fact that r becomes g'-temporal 
and t becomes g'-spatial for very small radii, juxtaposed with the a priori 
fixed character of these quantities with respect to the flat metric, does give 
him pause. Belinfante gives indications (including in the paper's title) that 
he does not believe deeply in the flat spacetime approach, so perhaps the null 
cone issue would not have interested him. While he is prepared to suggest 
that the "Swiss-cheese" -like behavior of the Schwarzschild solution in the 
bimetric context might help eliminate field theory's divergences, it is clear 



from review papers on quantum gravity |22, 2^ that the flat metric is just 
a tool-perhaps a useful one, but more likely not-for Belinfante. It is thus 
not too surprising that the null cone issue is ignored. The "[r]eal problem" 
is not to be found in "[t]heories, usually in flat space, which seek to be ap- 
proximations to Einstein's theory, or a perturbation-theoretical treatment 
of Einstein's theory", but in "[q]uantization of Einstein's theory itself." (pp. 
198, 192) For Belinfante, spacetime might have a Swiss cheese struc- 

ture, contain worm holes, or have a closed spatial topology [^]. Some of 
Belinfante's work with Swihart on linear gravity also neglects to discuss the 



null cone issue |20|,|21|. 



2.2 The Years 1959-1979: the Problem Dismissed or Post- 
poned 

The null cone consistency issue is perhaps first discussed in print by W. 
Thirring in 1959 |25- p^] , but then dismissed with a resolution that does not 
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permit a true special relativistic interpretation. Thirring clearly recognizes 
the apparent conflict between the two null cones |26|, writing, "Another fea- 
ture of the equations of motion ... we want to point out is that the velocity 
\d'x./dt\ is not required to be < 1 . . .Thus [assuming the curved metric to 
be diagonal] there is a limiting velocity c but it is space dependent and may 
exceed unity." (pp. 100, 101) A bit later, he writes "Since c is also gauge 
dependent and will exceed unity in some gauge systems [the matter equation 
of motion] even admits an apparently acausal behavior." (p. 101) However, 
Thirring thinks that this acausal behavior is only apparent, for he is satisfied 
with the fact that the "renormalized" velocity (measured physically using 
real clocks, which are distorted by the gravitational field) is not greater than 
unity: "However, we shall see shortly that c also corresponds to the velocity 
of light and that it becomes unity when measured with real measuring rods 
and clocks since they all are affected by the [gravitational] field." (p. 101) 
Evidently the unobservable nature of the intervals governed by rj^jj satisfies 
Thirring that the apparently acausal behavior is not a problem: "The real 
metric [interval corresponding to g^u\ is gauge invariant whereas [the inter- 
val corresponding to r]^y\ is not and therefore has no physical significance. 
Space-time measured with real objects will show a Riemannian structure 
whereas there are no measuring rods which could measure the original pseu- 
doeuclidean space." (p. 103) Thirring's argument is doubtful because the 
same distinction that was neglected by Gupta and Kraichnan is also ne- 
glected here: the non-measurability of the flat metric does not entail that 
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it lacks physical significance. Generally one considers causality to be an 
important physical concept. At the risk of stating the obvious, we recall 
that in special relativity, the relevant speed for causality is not the speed at 
which electromagnetic radiation actually propagates, but the value of the 
universal velocity constant (ordinary called "the speed of light" and written 
as c, but to do so here would invite confusion) which appears in Lorentz 
transformations, that is essential. As is well-known, to permit propagation 
faster than that speed in one frame is to admit backward causation-which is 
usually rejected-in another frame. Given the violation of the flat spacetime 
null cone, it is not clear what Thirring's field theoretic approach means. Yet, 
according to Thirring, the field theoretic approach gives "a theory following 
the pattern of well understood field theories, in particular electrodynam- 
ics." (p. 116) Thus, Thirring's list of advantages and disadvantages of the 
field and geometric approaches to gravity (pp. 116, 117) is notably incom- 
plete, because the obvious notion of causality for the field approach has been 
discarded. Thirring comes very close to noticing the problem of null cone 
inconsistency, but then stops short, apparently due to a prejudice against 
unobservable entities. 

One might hope that Thirring's almost-recognition of the problem would 
have inspired his successors to recognize and perhaps try to solve it. That, 
however, did not occur. In particular, although L. Halpern made a rather 
minute study of Thirring's paper |3^, the light cone issue receives only a sin- 
gle sentence (p. 388), one sufficiently noncommittal that no discomfort with 
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Thirring's purported resolution of the causality issue is obvious. Halpern 
was not an advocate of the flat spacetime approach to gravitation |3^, so 
it is the more remarkable that he overlooked a potentially serious difficulty. 
R. Sexl also was aware of the Thirring's work and even presented it at a 
conference |30|, yet he also accepted Thirring's ostensible resolution of the 



null cone conflict |3^, 31 1. 



The covariant perturbation program for quantizing general relativity 
yielded a large number of works based on expanding the curved metric into 
a background part and a dynamical part. Commonly the background metric 
was flat, leading to equations at least formally special relativistic. Thus, one 
might expect the question of the relation of the two metrics to be considered 
in some way. In fact, one finds rather less attention being given to this key 
conceptual issue than one would expect. 

A notable exception to the use of flat backgrounds is the work of B. S. 
DeWitt, who made great use of non-flat background metrics and found vari- 



ous benefits in doing so [ 134 1 . While DeWitt could make use of a background 
metric, to him it was always at most a tool, not a deep part of nature. In 
an article entitled "The Quantization of Geometry," he wrote: 

The problem of [quantizing the gravitational field] may be ap- 
proached from either of two viewpoints, loosely described as the 
"flat space-time approach" and "the geometrical approach." In 
the flat spacetime approach, which has been investigated by sev- 
eral authors . . . the gravitational field is regarded as just one 
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of several known physical fields, describable within the Lorentz- 
invariant framework of a flat space-time. Its couplings with other 
fields . . . lead to a contraction or elongation of "rigid" rods and 
a retardation or advancement of "standard" clocks .... Both 
the geometrical and flat space-time points of view have the same 
real physical content. However, it has been argued that the flat 
space-time approach provides more immediate access to the con- 
cepts of conventional quantum field theory and allows the tech- 
niques of the latter to be directly applied to gravitation. While 
there is merit in this argument, too strong an insistence upon it 
would constitute a failure to have learned the lessons which spe- 
cial relativity itself has already taught. Just as it is now univer- 
sally recognized as inconvenient (although possible) to regard the 
Lorentz-Fitzgerald contraction from relativistic modifications in 
the force law between atoms, so it will almost certainly prove in- 
convenient at some stage to approach space-time geometry, even 
in the quantum domain, in terms of fluctuations of standard in- 
tervals which are the same for all physical devices and hence 
unobservable. Q (pp. 267,268). 

Concerning the question of a well-defined causal structure, which his ap- 
proach appeared to lack, he later suggested, "Critics of the program to 
quantize gravity frequency [sic] ask, 'What can this mean?' A good answer 
to this question does not yet exist. However, there are some indications 
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where the answer may he." [134]. DeWitt's vision for the program, which 
he was prepared to cah "covariant quantum geometrodynamics" in a volume 
honoring J. A. Wheeler (the title itself suggesting sympathy for a geomet- 
rical view of gravitation, much as "The Quantization of Geometry" did), 
included that it "should be able to handle any topology which may be im- 
posed on 3-space" |135|| (p. 437). Of the covariant perturbation formalism, 
he wrote that the "most serious present defect of the covariant formalism is 
its foundation in scattering theory, with spacetime being assumed asymptot- 
ically flat. The method of the background field, which we have introduced, 
indicates a way in which this defect may be removed" [|135|| (p. 437). It 
seems very likely that the null cone issue, as we have formulated it, would 
not be important to DeWitt, given that the background metric was merely 
a tool for investigating a truly geometrical theory. 

Other authors, especially in the particle physics tradition, seem at least 
somewhat more content with a flat background metric. In his lectures on 
gravitation, R. Feynman shows himself ambivalent about the interpretation 
of gravitation. After deriving Einstein's equations from a flat spacetime 
field theory, he concluded from the unobservability of the flat metric that 
the latter was not essential. Using an analogy with curiously intelligent 
insects walking on a tiled floor, he says that "[tjhere is no need to think 
of processes as occurring in a space which is truly Euclidean, since there is 
nothing physical which can ever be measured in this flctional space. The 
tiles represent simply a labelling of coordinates, and any other labelling 
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would have done just as well" ||4^ (p. 101). Concerning the "assumption 
that space is truly flat," he concludes that "[i]t may be convenient in order 
to write a theory in the beginning to assume that measurements are made 
in a space that is in principle Galilean, but after we get through predicting 
real effects, we see that the Galilean space has no significance" (p. 112), but 
serves only as a "bookkeeping device" (p. 113). Concerning the "relations 
between different approaches to gravity theory," 

[i]t is one of the peculiar aspects of the theory of gravitation, that 
it has both a field interpretation and a geometrical interpreta- 
tion . . . these are truly two aspects of the same theory . . . the fact 
is that a spin-two field has this geometrical interpretation; this 
is not something readily explainable-it is just marvelous. The 
geometric interpretation is not really necessary or essential to 
physics. It might be that the whole coincidence might be under- 
stood as representing some kind of gauge invariance. It might 
be that the relationship between these two points of view about 
gravity might be transparent after we discuss a third point of 
view .... (p. 113) 

Feynman seems to feel free to switch between the two views as he sees 
fit. Questions about nontrivial topologies or the desire to have a trans- 
parent notion of causality seem not to have occupied him. Had they, he 
might have hesitated in proclaiming them to be "the same theory", given 
the competition between causality and gauge invariance. Later, in devel- 
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oping the covariant perturbation theory, Feynman did not address these 
issues, but wrote as if no conceptual difficulties existed. He wrote: "The 
questions about making a 'quantum theory of geometry' or other concep- 
tual questions are all evaded by considering the gravitational field as just 
a spin-2 field nonlinearly coupled to matter and itself (one way, for exam- 
ple, is expanding g^^ = 5f^u + h^y and considering hfj_y as the field variable) 
and attempting to quantize this by following the prescription of quantum 
field theory, as one expects to do with any other field. The central diffi- 
culty springs from the fact that the Lagrangian is invariant under a gauge 
group," but this issue, he finds, can be resolved by adding a gauge-fixing 
term, the result being "completely satisfactory" at the level of tree diagrams 
(which correspond to the classical theory) [Q; see also (I^H). If the main 
difficulty is gauge invariance-which in fact competes with special relativistic 
causality, as Thirring nearly realized-and if the gauge-fixing terms lead to 
a "completely satisfactory" result without regard to the light cone relation- 
ship, then, unless we are to charge Feynman with oversight, clearly the flat 
metric is merely a useful tool for him. However, the claim to have avoided 
all conceptual questions cannot be sustained, because the light cone issue 
is just such a question, and the meaning of parts of Lorentz-covariant field 
theory remains obscure if the problem is ignored. Huggins, a student of 
Feynman, also neglects to consider the null cones issue [^]. 

S. Mandelstam presented a critique of the flat-space covariant pertur- 
bation program as Gupta had developed it |3^. Gupta had imposed the 
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DeDonder coordinate (gauge) condition. Let us see how close Mandelstam 
comes to identifying the null cones issue. He writes: "Quantization in flat 
space can only be regarded as a provisional solution of the problem for sev- 
eral reasons," such as its approximate (at least at that stage of development) 
character, the use of an indefinite metric, and the presence of unphysical 
states. "But the main objection to this method of quantization lies surely 
in the physical sacrifices it makes by going to flat space. The variable spec- 
ifying the coordinates are numbers without physical significance which can 
be chosen in an infinite variety of ways. On the other hand, distances in 
space-time, which are physically significant entities, are related to the co- 
ordinates in a manner which has not been elucidated when the metric is 
quantized." However, perhaps these objections can be met: "It may be 
possible to add to the theory a prescription for interpreting its results phys- 
ically. If it could then be shown that the predictions of the theory were 
independent of the coordinate conditions used, and that they tended to the 
predictions of the unquantized theory in the classical limit, we would have a 
satisfactory theory. Some progress has actually been made in this direction 
by Thirring" , which "indicates the connection of the Gupta variables to the 
metric," though "the basic difficulties of the 'flat space' approach remain." 
Clearly one of Mandclstam's worries is the question of gauge invariance in 
a procedure that makes use of coordinate conditions. It is difficult to tease 
out a clear statement of worry about rival null cones from these remarks, 
though the issue might have been intended among the "the basic difficulties 
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of the 'flat space' approach" that remain. 

A moment of clarity occurred in 1962 with the appearance of a paper 
by J. R. Klauder |38|, whose abstract opens with the statement, "[i]n any 



quantum theory, in which the metric tensor of Einstein's gravitational theory 
is also quantized, it becomes meaningless to ask for an initial space-like 
surface on which to specify the conventional field commutators." Klauder 
elaborates: 

In so far as [certain] formalisms [for quantizing gravity] are tran- 
scriptions of techniques successful in a flat Lorentz space-time, 
they ignore a unique problem peculiar to general relativity. Con- 
ventional field theories deal, in particular, with commutation 
rules, which, when employed for the fields separated by a space- 
like interval, have an especially simple form. Whether two nearby 
points are or are not space-like is a metric question that can be 
asked (and in principle answered) not only in flat space but also 
in any space with a preassigned curved metric as well. However 
as soon as the space-time metric g^u{x) becomes a dynamical 
variable-as in Einstein's theory-then an initial space-like sur- 
face on which to specify commutators of any two fields becomes 
a meaningless concept. 

Klauder's approach to handling this problem was to propose an alternative 
formalism in which fields can fail to commute at most only at the same 
event-a radical move in the opposite direction. Unfortunately, Klauder's 
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acute awareness of the null cone issue did not spread widely. 

S. Weinberg did considerable work on gravitation considered as a Lorentz- 
invariant theory [^^53|. Concerning the geometric interpretation of general 
relativity, Weinberg could write that "the geometric interpretation of the 
theory of gravitation has dwindled to a mere analogy, which lingers in our 
language . . . but is not otherwise very useful. The important thing is to be 
able to make predictions about images on photographic plates, frequencies 
of spectral lines, and so on, and it simply doesn't matter whether we ascribe 
these predictions to the physical effect of gravitational fields or to a curva- 
ture of space and time." |^2| (p. 147) This ambivalence about the meaning 
of the theory perhaps helps to explain why the null cone consistency issue 
appears to be ignored in Weinberg's writings. However, the meaning of 
concepts used in Lorentz-invariant field theory in which Weinberg's work is 
rooted, or at least its relation to an underlying classical theory, does seem 
somewhat obscure if this issue is neglected. Somewhat more recently, R. 
Penrose reported that Weinberg was "no longer convinced that the anti- 
geometrical viewpoint is necessarily the most fruitful" on account of 
some impossibility theorems In recent personal communication with 

one of us (J. B. P.), Weinberg stated that he is no longer a strong advo- 
cate of any view on the subject, though it is quite interesting that the flat 
spacetime approach reproduces general relativity. 

Based on the "spin limitation principle," which requires that only definite 
angular momenta be exchanged, V. I. Ogievetsky and I. V. Polubarinov have 
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derived Einstein's equations and a family of massive relatives thereof in flat 



spacetime [SS-G^]- While this principle is quite attractive, it fails to pay 
any heed to whether the resulting theories yield propagation consistent with 
the causal structure of the flat metric. Given that some of their theories 
are massive and thus make the r/^jy observable, this shortcoming seems fairly 
serious. While we can find no mention of the null cone issue in the work 
of Ogievetksy and Polubarinov, it would be interesting to see if the spin 
limitation principle could be generalized in such a way as to yield consistency 
of the null cones. 

A large amount of work related to the field approach has been done 
by S. Deser, sometimes with collaborators such as D. G. Boulware, R. 
Nepomechie, A. Waldron or others. In the course of papers which derived 



general relativity via self-interaction in flat spacetime |68] or curved |75], 



or general relativity from quantum gravity [^,71], or supergravity from 
self-interaction or which study bimetric theories for a festschrift for N. 



Rosen |78|, we can find no mention of the issue of the null cone consistency 
issue. In particular, Deser finds the main issues for bimetric theories to 
be essentially the same problems that he and Boulware found in massive 



variants of general relativity |162], viz., empirically falsified light bending 
properties, negative energy disasters, or both |7^. Unlike some authors who 
have a strong preference, Deser (after a rather pro-geometrical paper early 



on |164]) seems to admire both the geometric and field formulations: "The 
beautiful geometrical significance of general relativity is complemented by 
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its alternate formulation as the unique consistent self-coupled theory aris- 
ing from flat-space free gravitons, without appeal to general covariance." 
J75[ However, depending on how one reads the flat spacetime approach, one 
might obtain some different features, as we will observe below, so one might 
prefer to see the meaning of the field formalism addressed. In fact the closely 
related issue of the topology of spacetime does receive some attention ||6^ . 
Clearly Deser is not interested in taking the flat background with the utmost 
seriousness, for he states that solutions to Einstein's equations not connected 
to Minkowski spacetime ought to be considered. He even suggests that one 
need not allow for nontrivial topologies by hand, because possibly they arise 
automatically. The question of respecting the conformal structure of the flat 
background does not arise. Perhaps the closest one finds to discussion of 
the conformal structure is in the work by Deser and R. Nepomechie on the 
anomalous propagation of gauge fields in some conformally flat spacetimes. 



compared to a flat background |76, 77 1 with the same null cone structure. 



In particular, backscattering off the geometry causes the propagation to lie 
not merely on the null cone, but inside it. However, "while our results are 



surprising, they do not imply any consistency problems" |76|, as they would 
if the propagation were outside the null cone.Q 

Some time ago it was recognized by G. Velo and D. Zwanziger that spin 
I field propagation has causality worries. They found that the "main lesson 
to be drawn from our analysis is that special relativity is not automatically 
satisfied by writing equations that transform covariantly. In addition, the 



^We thank Prof. Deser for calling our attention to this issue and the spin | field. 
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solutions must not propagate faster than light" | ]161| . A rather similar lesson 
needs to be learned regarding the gravitational field (spin 2) as well, for this 
is just the point that Gupta and Kraichnan missed. 

During the 1970s, the relation between the two null cones continued to be 
neglected in the context of the covariant perturbation approach to quantizing 
general relativity, at least in practice. However, quantum gravity review 
talks drew attention to this problem from time to time. This service was 
performed with special clarity in a 1973 review by A. Ashtekar and R. Geroch 
|14C| . They find that much of the difficulty in quantizing the theory arises 
from the fact that "the distinction between the arena and the phenomenon, 
characteristic of other physical theories, is simply not available in general 
relativity: the metric plays both roles." [|140| (p. 1214) In discussing field 
theoretic approaches, they write that 

[i]t is normally the case in quantum field theory . . . that two dis- 
tinct fields come into play-a kinematical background field (the 
metric of Minkowski space) and a dynamical field .... One can 
certainly regard general relativity as a field theory, but in this 
case there is only a single field, the metric gab of spacetime, which 
must play both these roles. But the application of the techniques 
of quantum field theory apparently requires a non-dynamical 
background field. In quantum electrodynamics, for example, the 
causality of the Feynman propagators and the asymptotic states, 
in terms of which the 5-matrix is defined, refer directly to the 
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metric of Minkowski space. Thus, one does not expect to be able 
to carry over directly to general relativity, regarded as a classical 
field theory, the procedure which led for example from classical 
Maxwell theory to quantum electrodynamics. In order to apply 
the techniques of quantum field theory one must, apparently, ei- 
ther modify these techniques or reformulate the interpretation 
of general relativity as a field theory, (p. 1229) 

This latter suggestion of reinterpreting general relativity does not strike us 
as being unimaginable or even excessively difficult, especially given that 



Kraichnan had presented a simple and clean derivation already in 1955 |17]. 
However, Ashtekar and Geroch do present some objections to this general 
line of attack. "It turns out, however, that this perturbation approach to 
obtaining a quantum theory of the gravitational field suffers from a number 



of difficulties. There exist, [141| for example, four-dimensional manifolds M 
on which there are metrics gab of Lorentz signature, but on which there are 
no flat metrics." (p. 1232) However, it is not clear why such examples must 
be regarded as physically admissible. If it could be shown that some exact 
solutions of obvious physical utility admit no flat metric, then the argument 
would be persuasive, but that argument was not made. Thus, it seems that 
this argument against the perturbation approach will be highly persuasive 
only if one is already committed to a geometrical view of Einstein's equations 
at the classical level. But one would exaggerate only slightly to say that 
this is the point at issue. The idea of requiring that the curved null cone 
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be consistent with the flat one seems not to have been entertained, but 
Ashtekar and Geroch have shown powerfully, if reluctantly, why such an 
approach merits consideration. 

The null cone consistency issue was emphasized by C. J. Isham at the 
first Oxford quantum gravity symposium |177| . We quote from pp. 20, 21: 

One natural approach perhaps is to separate out the Minkowski 
metric r]^u and write g^^ix) = -q^y + h^y{x) where h^y{x) de- 
scribes the deviation of the geometry from flatness . . . [which 
approach has some advantages.] However, there are a number 
of objections to this point of view. For example: (i) The ac- 
tual background manifold may not be remotely Minkowskian in 
either its topological or metrical properties, in which case the 
separation [above] ... is completely inappropriate, (ii) Even if 
[the equation above] is justified (from the point of view of i)) 
[sic] the procedure is still dubious because the lightcone struc- 
ture of the physical spacetime is different from that of Minkowski 
space. For example, if the field (j) has some sort of microcausality 
property with respect to the metric gfj,y then this is not equiv- 
alent to microcausality with respect to the fictitious Minkowski 
background. 

Once again a prior commitment to a geometrical view of general relativity is 
manifest. Isham seems not to entertain the idea of requiring that the space- 
time be compatible with the flat background, but at least the consistency 
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issue is clearly stated. At the second Oxford symposium, Isham observed 
that "one of the ambitions of the Riemannian programme is to free quantum 
gravity from perturbation theory based on the expansion g^^iy = g^^^^+^/Gh^y. 
Expansions of this type are known to be bad in classical general relativity 
and they clearly misrepresent the global topological and lightcone structures 
of the pair {M^g^^y^ [|178| (p. 14). We suggest that no misrepresentation 
occurs if one chooses {M,g^y) suitably. 

Isham has continued to mention this issue of null cone troubles in more 



recent talks in the context of the problem of causality and time [152, 189]. 
The problem of time shows up in the light cone issue for the covariant 
perturbation approach to quantum gravity, but related difficulties show up 
elsewhere [ [189| . Still more recently, Isham has expressed the issue as follows: 

The problem of time The background metric r] provides a fixed 
causal structure with the associated family of Lorentzian iner- 
tial frames. Thus, at this level, there is no problem of time. The 
causal structure also allows a notion of microcausality, thereby 
permitting a conventional type of relativistic quantum field the- 
ory to be applied to the field hap. 

However, many people object strongly to an expansion [of the 
curved metric into a flat one plus a dynamical part] since it is 
unclear how this background causal structure is to be related 
to the physical one; or, indeed, what the latter really means 
... it is not clear what happens to the microcausal commutativity 
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conditions in such circumstances; or, indeed, what is meant in 
general by 'causahty' and 'time' in a system whose hght cones 
are themselves the subject of quantum fluctuations. |152(| (p. 58) 



While these are interesting questions, there seems to be no reason to think 
them unanswerable; indeed we shall substantially answer some of them. 

Another moment of awareness of the null cone consistency issue in a 
quantum gravity review talk comes from P. van Nieuwenhuizen at the first 
Marcel Grossmann meeting. After showing keen awareness of the problem, 
van Nieuwenhuizen shelves it. He writes: 

According to the particle physics approach, gravitons are treated 
on exactly the same basis as other particles such as photons and 
electrons. In particular, particles (including gravitons) are al- 
ways in flat Minkowski space and move as if they followed their 
geodesies in curved spacetime because of the dynamics of multi- 
ple graviton exchange. This particle physics approach is entirely 
equivalent to the usual geometric approach. Pure relativists of- 
ten become somewhat uneasy at this point because of the fol- 
lowing two aspects entirely peculiar to gravitation: 

• In canonical quantization one must decide before quanti- 
zation which points are spacelike separated and which are 
timelike separated, in order to define the basic commuta- 
tion relations. However, it is only after quantization that 
the fully quantized metric field can tell us this spacetime 
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structure. It follows that the concept of space-like or time- 
like separation has to be preserved under quantization, and 
it is not clear whether this is the case. (One might wonder 
whether the causal structure of spacetime need be the same 
in covariant quantization as in canonical quantization.) 

• Suppose one wanted to quantize the fluctuations (for exam- 
ple of a scalar field, or even of the gravitational field itself) 
about a given curved classical background instead of about 
flat Minkowski spacetime. In order to write the field opera- 
tors corresponding to these fluctuations in second-quantized 
form, one needs positive and negative frequency (annihila- 
tion and creation) solutions. In non-stationary spacetimes 
it is not clear whether one can define such solutions. (It 
may help to think of non-stationary space-time as giving 
rise to a time-dependent Hamiltonian.) 

The strategy of particle physicists has been to ignore these two 
problems for the time being, in the hope that they will ultimately 
be resolved in the final theory. Consequently we will not discuss 



them any further. [ SI | 



He raised the issue again in another work While quantization is not 
our immediate concern, a similar worry to the first of these two exists at 
the classical level if one wishes to take the flat metric seriously: there is 
no reason to expect that the dynamics will yield automatically a physical 
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causal structure consistent with the a priori special-relativistic one, but 
inconsistency leads to grave interpretive difficulties. As this history shows, 
workers in the covariant perturbation quantum gravity program gave most of 
their attention to technical issues, not conceptual questions (see also ( [|8^ ). 

2.3 The Years 1979-2001: the Problem Increasingly Attended 
and the Development of Three Views 

More recently, the question of null cone consistency has come to be recog- 
nized as interesting somewhat more often. While a fair number continue to 
neglect the issue, those who have addressed it can be found to have one of 
three attitudes toward the flat metric: that it is a useful fiction, that it is a 
useless fiction, or that it is the truth. These views will be considered in turn. 
First we note some recent signs of the growing awareness of the problem. 

In the 1984, the subject made its way into a standard text |148| ]. R. Wald 
writes: "The breakup of the metric into a background metric which is treated 
classically and a dynamical field ^ab, which is quantized, is unnatural from 
the viewpoint of classical general relativity. Furthermore, the perturbation 
theory one obtains from this approach will, in each order, satisfy causality 
conditions with respect to the background metric rjab rather than the true 
metric Qab- Although the summed series (if it were to converge) still could 
satisfy appropriate causality conditions, the covariant perturbation approach 
would provide a very awkward way of displaying the role of the spacetime 
metric in causal structure." []148| (p. 384). Once again, a prior commitment 
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to a geometrical understanding of classical gravity is evident. Some of Wald's 
negative attitude toward the "breakup" of gab results from assuming that 
the curved metric is fundamental, not derived. But given how easy it is 



to derive Einstein's equations from a flat spacetime theory ||17, 121|, why 
should one not regard the curved metric as derived? Be that as it may, one 
is pleased that the light cone issue is emerging from the neglect that it once 
suffered. It is intriguing that Wald suggests that the whole series might 
be (7ab-causal even though each term is rjab-causal. The most obvious way 
for such to occur would be for the curved metric's null cone in fact to be 
confined on or within the flat one's. If that is the case, then it seems that 
Wald is almost suggesting (albeit reluctantly) what we will do below. 

The recent contemplation of "naive quantum gravity" by S. Weinstein 
also has called attention to the lack of a fixed causal structure in quantum 
gravity |179(| . If one is interested in full quantum gravity, as opposed to 
semiclassical work, then "we would expect that the metric itself is subject 
to quantum fluctuations . . . But if the metric is [subject to quantum fluc- 
tuations], then it is by no means clear that it will be meaningful to talk 
about whether x and y are spacelike separated, unless the metric fluctu- 
ations somehow leave the causal (i.e. conformal) structure alone." (pp. 
96-7) There appear to be two things that this last suggestion might mean. 
First, it might mean that the metric is conformally flat, so that the causal 
structure is just that of flat spacetime, while gravity is described by a scalar 
field. However, it is well-known that scalar gravity is empirically falsified by 
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the classical tests of general relativity [17|, so this must not be Weinstein's 
intent. Second, it might mean that, although the full metric is allowed to 
vary, its variations are bounded so that the null cone of the nondynamical 
(and presumably flat) metric is respected. Correspondence between J. B. P. 
and Weinstein makes clear that this in fact was intended. That is what we 
propose here. Weinstein does indeed consider "whether it is at all possible to 
construe gravitation as a universal interaction that nonetheless propagates 
in flat, Minkowski spacetime." (p. 91) He concludes that 

the short answer is, 'No,' for three reasons. First, the 'invisibil- 
ity' of the flat spacetime means that there is no privileged way 
to decompose a given curved spacetime into a flat background 
and a curved perturbation about that background. Though this 
non-uniqueness is not particularly problematical for the classical 
theory, it is quite problematical for the quantum theory, because 
different ways of decomposing the geometry (and thus retriev- 
ing a flat background geometry) yield different quantum theo- 
ries. Second, not all topologies admit a flat metric, and therefore 
spacetimes formulated on such topologies do not admit a decom- 
position into flat metric and curved perturbation. Third, it is 
not clear a priori that, in seeking to make a decomposition into 
background and perturbations about the background, the back- 
ground should be flat. For example, why not use a background 
of constant curvature? (p. 92) 
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However, these arguments are less than compehing. Concerning the first ar- 
gument, Weinstein provides neither argument nor citation. It appears to be 
a claim that a suitably gauge-invariant theory cannot be constructed. But 
why should one believe that? Concerning the second objection, which re- 
sembles that of Ashtekar and Geroch, the advocate of flat spacetime will ask 
"why are nontrivial topologies necessary?" There are no facts or even good 



arguments that require them at present. [ 90 1 In the absence of such, the 
insistence that nontrivial topologies are theoretically necessary is close to 
question-begging. For why not merely adopt a nongeometrical view of grav- 
ity at the classical level, too?^ Concerning the last objection, it seems clear 
to us that a flat background is the default choice because it is simpler than 
any other choice. While any other choice requires some argument for making 
that choice instead of the others and strongly suggests the question "why 
does spacetime have this geometry?", flat spacetime does not, but rather is 
the obvious default choice. Weinstein's specific alternative suggestion of a 
constant curvature spacetime, for example, suggests the question "why does 
the curvature take this value, as opposed to some other value?" There is a 
one-parameter family of constant curvatures that one might specify, but flat 
spacetime, requiring no parameter, is simpler. We can agree with Weinstein 
that in "allowing metric fluctuations to affect causal structure, one is clearly 
at some remove from ordinary field-theoretic quantization schemes." (p. 97) 

But it seems unclear, pace Weinstein, that there is any need to renounce the 
^The issue of the topology of the universe has seen a fair amount of attention lately, 



with the aim of experimental test-see, for example, 180-L85|. 
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use of a flat background causal structure. 

It is unfortunate that some recent articles still do not address the null 
cone issue. However, enough have done so that one can identify three major 
attitudes toward the use of a flat metric that one finds. One view is that 
the flat metric is a useful fiction. Another, more purely geometrical view 
holds that the flat metric is a useless fiction. The third view regards the flat 
metric as the truth. We survey these approaches in this order. 

2.4 Field Formulation: the Flat Metric as a Useful Fiction 

Some authors have explicitly stated that the flat metric is merely an auxil- 
iary object, formally useful but not tied to the causal structure of the theory 
[^-^, 112| , The reasons given include the gauge- variance of the rela- 



tionship between the null cones and the unobservability of the flat metric. 
The first objection will be answered below by a new definition of a gauge 
transformation. These authors seem to have assumed that gauge transfor- 
mations should form a group, whereas we find a groupoid to be a natural 
substitute. The fact that the flat metric's null cone is sometimes violated 
using otherwise-convenient gauges appears to be another reason: it does not 
appear possible to fix the gauge to be, say, tensorial DeDonder and have the 
null cone relationship be automatically satisfactory. L. P. Grishchuk has 
written that "the mutual disposition of the light cones of the g^u and r]^i, 
can be of interest only in the case when the attempt is made to interpret 
the metric relations of the world as observable," which efforts are of course 
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bound to fail, he continues [Q. Unfortunately, Grishchuk has overlooked 
the same distinction that Gupta, Kraichnan, Thirring, Feynman, and prob- 
ably many others missed, and has failed to recognize that if the null cones 
can be made consistent, then a conceptual difficulty posed by quantization 
would be eliminated. 

A. N. Petrov describes the same view (though we have taken the liberty 
of spelling out with words the abbreviations used): 

However, the background in the field formulation of general rel- 
ativity is not observed. The movement of test particles and 
light rays is not connected with the geometry of the background 
spacetime. The light velocity in the background spacetime can 
approach an infinite value. In contrast, in the geometrical for- 
mulation of general relativity the test particles and the light rays 
define the geodesies in real physical spacetime. Thus, the back- 
ground spacetime in the field formulation of general relativity 
is an auxiliary and nonphysical (fictitious) concept . . . We stress 
that the field formulation of general relativity and the geometri- 
cal formulation of general relativity are two different formalisms 
for a description of the same physical reality and they lead to the 
same physical conclusions . . . there are no obstacles in treating 
any solution to general relativity (spacetime) in the framework 
of the field formulation of general relativity. However, it is clear 
that a manifold which supports a physical metric will not coin- 
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cide in general with a "manifold" which supports an auxiliary 
metric. As a result, in the field configuration on the auxiliary 
nonphysical background, "singularities," "membranes," "abso- 
lute voids," and others can appear. This leads to cumbersome 
and confused interpretations and explanations. Thus, the whole 
spirit of general relativity itself requires the investigation of many 
problems with the help of the geometrical formulation technique. 
However, there exists problems [sic] for an investigation in which 
the field formulation technique is more convenient |113| (pp. 452, 
453). 

Thus, the field formulation is seen as a tool that sometimes is helpful, but 
sometimes not so convenient, and in any case not to be trusted in addressing 
deep issues. 



A similar attitude has been taken by D. E. Burlankov |95|, who did 



some early work using a flat background metric as a convenient fiction |35]. 
Burlankov objects to the fundamental status of Minkowski spacetime be- 
cause of the gauge-variance of the null cone relation, and also because the 
curved null cone differs from the flat null cone |^^. The former argument 
will be addressed in due time. The latter argument, in Burlankov's hands, 
is said to imply that only curved metrics conformally related to the flat 
background would be acceptable. But this objection is just unpersuasive. 



36 



2.5 Geometrical Formulation: the Flat Metric as a Useless 
Fiction 

Other authors have taken the view that the flat metric is a blemish on the 
pure geometric beauty of general relativity, and thus is to be avoided. Such 



a description would seem to flt R. Penrose and J. Bicak [171|. This 
negative attitude toward the flat metric seems to have motivated Penrose to 
note that the null cone issue really must be handled if the Lorentz-covariant 
approach (which he associates with Weinberg) is to be considered satisfac- 
tory. Penrose, recognizing the connection between scattering theory and 
the Lorentz-covariant perturbation approach to gravity, poses a dilemma 
for the latter. Using global techniques, he shows that either the curved 
null cone locally violates the flat one, or the scattering properties become 
inconvenient because the geodesies for the two metrics continue to diverge 
even far away from a localized source. He concludes that a "satisfactory" 
relationship between the two null cones cannot be found. Concerning the 
horns of Penrose's dilemma, we simply accept the second one. It is known 
that long-range fields have inconvenient scattering properties [176|. We find 



that the root of the divergence between the geodesies is merely the long- 
range i character of the potential in the conformally invariant part of the 
curved metric. If the fall-off were a power law of the form e > 0, then 
no difficulty would arise. So this objection is basically a reflection of the 
fact that a long-range symmetric tensor potential exists. But why is that a 
fundamental problem? 
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2.6 Special Relativistic Approach: the Flat Metric as the 
Truth 

Besides the field-in-fictitious flat spacetime and geometrical approaches, 
there is another attitude that one might take toward the flat metric ap- 
proach, viz., that special relativity is correct in its usual strict sense (global 
Lorentz invariance, trivial spacetime topology, and no violation of ry-causality) , 
and thus that the gravitational field must be made to respect the flat metric's 
causal structure. This view is more conservative than the other views 
(p. 101), and is sufficiently obvious and attractive an idea that one might 
expect it to have been explored thoroughly, probably decades ago, and either 
sorted out or refuted. But that expectation would be disappointed. Demon- 
strating this surprising fact was one purpose of the substantial review of the 
history of the subject above. Some authors have claimed to have sorted it 
out, and some to have refuted it, but we disagree on both points. 

For the sake of convenience, this approach needs a name. We will use 
the term "special relativistic approach" (SRA). We have resisted calling this 
approach a "formulation" to match Petrov's "field formulation" and "geo- 
metric formulation," because it turns out that the SRA is in fact physically 
distinct, though in rather subtle and recondite ways, from the geometrical 
approach. The SRA takes a realist attitude toward the flat metric, whereas 
the field formulation takes an instrumentalist attitude. Some have objected 
to regarding a theory based on the Einstein equations as something other 



than general relativity |^-94|. Others, including J. Norton, have insisted 
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that the SRA is distinct from general relativity. Perhaps the common usage 
of the term "general relativity" is simply too vague to provide a resolution 
to this difficulty. If nothing else were at stake, one would avoid pretentious 
claims of a new theory. But as will appear below, there is in fact be a 
physical difference, which might even be testable in exotic circumstances, 
between the two approaches. The phenomena of collapse to form black holes 
seem to be altered somewhat in the SRA, not least because the SRA implies 
global hyperbolicity. 

Let us recall some common terminology regarding lengths and times 
given by the two metrics. Those measured using ordinary rods and clocks 
will be distorted by gravity and thus be governed by g^i, . These are "renor- 
malized" measurements. Lengths and times 'measured' using ideal rods and 
clocks, which are not distorted by gravity, will be governed by rji^^,. These 
are "unrenormalized" measurements. Let us also write ds^ for the renor- 
malized ^-interval and da^ for the unrenormalized ry-interval. These two 
types of lengths and times have different uses. Renormalized measurements 
will serve for all mundane purposes. Unrenormalized measurements, on the 
other hand, will be relevant in more mathematical or metaphysical contexts. 
For example, in deciding whether a solution of Einstein's equations exactly 
covers Minkowski spacetime, one must unrenormalized measurements. The 
solution should be neither too small for Minkowski spacetime, nor too large 
for it. Flat Robertson- Walker models with finite renormalized age appear 
too small, because naively it appears that the early end of Minkowski space- 
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time just lacks a curved metric. This problem can likely be resolved by 
stretching the Robertson- Walker solution so that the singularity occurs at 
unrenormalized past infinity. This move resembles one by C. Misner |194], 



A. G. Agnese and A. Wataghin |195], and J.-M. Levy-Leblond [196|, but 



our introduction of a flat background metric gives a compelling physical 
motivation, which they perhaps lacked. 

On the other hand, exact gravitational plane wave solutions appear to 
be too large for Minkowski spacetime. Whereas monochromatic linearized 
plane wave solutions in the Hilbert gauge give local causality violation, exact 
plane wave solutions can yield a global violation of causality due to lack of 
a Cauchy surface. These solutions have been called "caustic plane waves", 



and were first noted by Penrose [197|. Bondi and Pirani write, 



We call a plane gravitational wave caustic if it is capable of in- 
ducing accelerations of test particles so strong that two particles, 
initially at rest in flat spacetime and aligned suitably with re- 
spect to the wave, but arbitrarily far apart, will collide, within 
a finite time interval that is independent of their initial sepa- 
ration, after being struck by the wave. We shall show that all 
plane sandwich waves with fixed polarization are caustic, and de- 



scribe some unusual optical properties of such waves .... [P02 | 
(p. 395). 

We recall that a sandwich wave is one having a limited duration, so the 
wave is 'sandwiched' between two regions of flat spacetime. A special case 
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of this caustic phenomenon is found in W. Rindler's text ]203| ] (pp. 166- 
174). Such a phenomenon is obviously inconsistent with the SRA. In a set 
of coordinates that one rather naturally chooses, there exists a coordinate 
singularity in this solution |20S]. In 1937 N. Rosen concluded that plane 
waves in general relativity do not exist, because the metric becomes singular 



[poo , 201]. Later it was shown by I. Robinson and by H. Bondi that there 
exists a coordinate transformation that removes Rosen's singularity, so in 
the geometrical theory, such waves are in fact nonsingular and acceptable 
[ p.98| , 199]. We suspect that the plane wave solution can be fit into the SRA 
by truncating it at the coordinate singularity, which would then occur at 
unrenormalized future infinity. A mathematical treatment of these matters 
should appear in the future. 

The SRA has the advantage of simple and fixed notion of causality at 
the classical level, because the flat null cone serves as a bound on the curved 
one. In this view, one becomes less dependent on the study of topology, 
global techniques, careful definitions of causality of various sorts, and the 



like, such as modern texts contain [142, 148|] . The SRA is therefore simpler 
in an obvious way than the alternatives. There is no difficulty in extending 
this nondynamical causal structure into the quantum regime, so there should 
be no problem in writing down equal-time commutation relations, etc. in 
the usual way. Thus, worries expressed by Ashtekar and Geroch, Isham, van 
Nieuwenhuizen, Wald and Weinstein above are resolved. 

While the special relativistic approach to Einstein's equations is locally 
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and classically equivalent to the usual theory, as we saw earlier, there might 
be different global or quantum properties. For example, though flat space- 
time with trivial topology is stable in general relativity [187|, closed flat 



space is unstable [188], so it appears that the usual topology is more than 
just a simple and convenient choice for the SRA. Also, it likely will turn out 
that some regions of spacetime in complete exact solutions of the geometrical 
theory simply do not exist in the SRA, perhaps due to infinite postponement 
from the lapse's tending toward 0. Moreover, the SRA of general relativity 
has less gauge freedom than the geometrical and field formulations, because 
any gauge choice that leads to an improper null cone relationship must be 
prohibited. To be more precise, the SRA configurations form a proper sub- 
set of the naive configurations, but with the same order of infinite size. The 
use of a new set of variables, in which only the null-cone respecting field 
configurations are possible, would be a way to prohibit them. We propose 
such a set below. 

The attitude of regarding the flat spacetime as fundamental has been 



most visibly promoted by A. A. Logunov and colleagues [p8| - |100(| (to name 

a few). I To distinguish their view clearly from any geometrical notions, 

they have given the name "relativistic theory of gravitation" (RTG) to the 

work. The nature of the RTG has evolved slightly over the years. For some 

time it consisted in Rosen's tensorial FF action for general relativity and 
^This school has also produced an energetic critique of geometrical general relativity 
as lacking physical meaning, in the sense of lacking conservation laws and failing to make 
definite predictions. We do not endorse this critique. 
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his tensorial DeDonder condition postulated as necessary, presumably 
with specification of trivial topology for spacetime. There is also attached 
a "causality principle" that requires that the curved null cone not violate 
the flat one ||98, 172, 174|. This causality principle, which seems to have 
appeared following criticisms by Zel'dovich and Grishchuk |93,^|, is the 
feature most relevant to our purposes. More recently, the RTG has often 
featured acquired a mass term, but we are interested especially in the mass- 
less version. (The massive version will have the same null cone consistency 
problem for gravitational waves.) 

Logunov et al, being committed to the flat spacetime view, regard the 
question of compatible null cones as requiring a solution. Furthermore, they 
believe it to be solved already by their causality principle, which we shall call 
the Logunov Causality Principle. This principle states that field configura- 
tions that make the curved metric's null cone open wider than the flat met- 



ric's are physically meaningless |98, 173, 174]. As they observe, satisfaction 



is not guaranteed (even with their gauge conditions, notes Grishchuk |94|), 
which means that the set of partial differential equations is not enough to 
define the theory. The Logunov Causality Principle is therefore enforced "by 
hand." Some causality principle is indeed needed, but the Logunov Causal- 
ity Principle strikes us as somewhat arbitrary and ad hoc. One would desire 
three improvements. First, one would prefer that the causality principle be 
tied somehow to the Lagrangian density or field variables, not separately 
appended Second, one wants a guarantee that there exist enough solu- 
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tions obeying the principle to cover all physically relevant situations. Third, 
one would prefer a more convenient set of variables to describe the physics. 
We address all of these matters below. 



Concerning the first shortcoming, it might be suggested |100| that the 



Logunov causality principle is analogous to the energy conditions [148| that 
one typically imposes. However, this analogy strikes us as weak. The dis- 
similarity is in how the two conditions accept or reject solutions. The energy 
conditions are used to exclude or include whole classes of matter fields, so 
any configuration with one sort of matter field-perhaps a minimally cou- 
pled massless scalar field with the correct sign in the Lagrangian density- 
is permitted, whereas any configuration with another sort of matter field- 
perhaps the scalar field with the wrong sign- is prohibited as unphysical. 
This criterion expresses the idea that some sorts of matter are physically 
reasonable, but others are not. Furthermore, there is no worry that a per- 
missible sort of matter could evolve into a forbidden sort in accord with the 
field equations. On the other hand, the Logunov Causality Principle cannot 
give (or at least has not given) a similarly general explanation for why it 
rejects some solutions of the field equations. 

A more serious problem is that it cannot give any assurance that it per- 
mits a sufficiently large number of solutions to cover all physical situations 
that arise. A priori there is no reason to believe that one can (partially) fix 
the gauge, and then still reject some solutions in the appropriate gauge as 
unphysical. A posteriori there seems to be good evidence that this worry is 



44 



serious. Let us imagine a young man, Nicholas, playing on a drum set. Con- 
templating the motion of his arms and sticks, we may be confident that the 
traceless part of the second time derivative of his quadrupole moment, con- 
tracted with itself, is usually nonzero. But that means that Nicholas emits 
gravitational radiation, for this is just the formula for the average power 
radiated in general relativity, under suitable assumptions |14^ . There will 
be anisotropic but roughly spherical waves of gravitational radiation diverg- 
ing from Nicholas. Far away from him, these waves will look approximately 
like plane waves obeying linearized gravity with the tensorial Hilbert gauge 
condition. The behavior of plane monochromatic single-polarization waves 
in linearized general relativity is well-known |192[| . In this gauge, the two 
energy-carrying polarizations both consist of alternately shrinking one trans- 
verse direction and stretching the other, while the time (lapse) and spatial 
propagation directions are unaffected |192(| . (Below we will analyze this lin- 
ear solution using a generalized eigenvalue-eigenvector formalism.) But the 
shrinking of one spatial eigenvalue while leaving the time lapse unaltered 
implies a violation of the flat metric's null cone. A test particle passing 
through the wave could achieve superluminal speeds. In short, it appears 
that, if the exact behavior of the plane waves is anything like the linearized 
behavior, then monochromatic gravitational radiation satisfying the tenso- 
rial DeDonder condition generically violates the Logunov causality principle. 
While monochromatic radiation is a rather idealized case, and thus perhaps 
need not obey ?7-causality by itself for a satisfactory theory, it is not at all 
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obvious that the superposition of monochromatic waves which all individ- 
ually violate ?7-causality yields a sufficiently generous set of realistic waves 
that satisfy that condition. Thus, there is reason to worry that the Lo- 
gunov causality principle cannot be implemented, because Nicholas in fact 
can play drums. Evidently, arranging for wave solutions to obey the causal- 
ity principle is rather more difficult than addressing most of the solutions 
that Logunov and collaborators have addressed to date, which often have 
considerable symmetry and fairly trivial dynamics. |172-175[. Thus, some 



way of enforcing null cone consistency without excluding necessary solutions 
of the field equations must be sought. 

The lesson that we draw from the apparent shortcomings of the RTG in 
its present and past forms is not, pace some authors that the flat 



metric must be considered merely a useful fiction. Rather, if the SRA is to 
be maintained, then a more fundamental approach to securing consistency 
between the null cones must be sought. One will want to use the gauge 
freedom of general relativity to secure null cone consistency. In that way, one 
can be confident that a sufficiently large number of solutions exist, because 
one member from each equivalence class of solutions will be included. 

3 Describing and Enforcing the Proper Null Cone 
Relationship 
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3.1 Consistent Null Cones by Suitable Gauge Restrictions? 

As several authors above pointed out, the local relation between the two 



null cones is indeed gauge-dependent in general relativity [39, (Below 
we will circumvent this problem by redefining gauge transformations.) One 
might therefore hope to design a set of gauge-fixing conditions that yield 
the desired behavior, or at least to impose restrictions on the variables that 
exclude unsuitable gauge choices while permitting suitable ones. 

To this end, the work of M. Visser, B. Bassett, and S. Liberati on "su- 



per luminal censorship" is quite interesting [204, 205]. Working in linearized 
gravity in the usual DeDonder-Lorentz-Hilbert gauge, and excluding grav- 
itational radiation, these authors find that the curved metric's null cone 
opens wider than the flat background's only if the null energy condition 
(NEC) is violated. The NEC being rather commonly satisfied, this result 
is quite encouraging for efforts to respect the flat metric's null cone in full 
nonlinear general relativity. However, as they note, the NEC does not hold 
universally. They also note that this result does not obviously or easily 
generalize to strong field situations. Finally, we note that their exclusion 
of gravitational radiation is a severe limitation, for, as the study of gravi- 
tational waves in linearized general relativity shows, gravitational radiation 
in this gauge does result in widening of the null cone relative to that of 
the flat background. That is why we will renounce this gauge condition, 
so either a better gauge must be found, or (more likely) one must give up 
gauge-fixing. Rather than putting any gauge conditions into the theory by 
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hand, one would prefer to implement them in the action principle or the 
choice of field variables somehow. 

It might be hoped that the ADM split of the metric [147, 148 1, which is 
quite useful in applications and in identifying the true degrees of freedom, 
would be a good language for discussing the null cone consistency issue. Let 
us see if that is the case. For convenience we choose Cartesian coordinates, so 
that rjf^i, = diag{—l, 1, 1, 1). We therefore make an ADM split of Logunov's 
4-dimensional analysis of the causality principle. In considering whether all 
the vectors lying on r/'s null cone are (7-timelike, gf-null, or (7-spacelike, it 
suffices to consider future-pointing vectors with unit time component; thus 

= (1, y*), where V^V^ = 1 (the sum running from 1 to 3). The causality 
principle can be written hij{P' + V^){P^ + V^) - >0 for ah spatial unit 
vectors V^. Here the spatial metric is hij, the lapse is A^, and the shift is 

One could visualize this equation as the requirement that an ellipsoid 
(not centered at the origin if the shift /?* is nonzero) not protrude from the 
closed unit ball. Unfortunately, the "all" in "for all spatial unit vectors" is 
not too easy to handle, so we will in fact look for a better language than an 
ADM split for discussing null cone consistency. 

If there to be any is hope for restricting the gauge freedom so as to ensure 
that the curved null cone stays consistent with the flat one, then there 
must be "enough" gauge freedom to transform any physically significant 
solution into a form that satisfies ry-causality. It is (very roughly) the case 
that the curved spatial metric controls the width of the light cone, while 
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the shift vector determines its tilt from the vertical (future) direction and 
the lapse function determines its length. For general relativity, the spatial 
metric contains the physical degrees of freedom; the lapse and shift represent 
the gauge freedom, so they can be chosen largely arbitrarily. By analogy 
with conditions typically imposed in geometrical general relativity to avoid 
causality difficulties [|148| , one would prefer, if possible, that the curved 
light cone be strictly inside the flat light cone, not tangent to it, because 
tangency indicates that the field is on the verge of /^-causality violation. This 
requirement we call "stable ?7-causality," by analogy to the usual condition of 
stable causality |148|. One might worry that this requirement would exclude 
all curved metrics conformally related to the flat one, and even the presumed 
"vacuum" g^^^ = rj^y itself. Indeed it does, but if one takes the message of 
gauge invariance seriously, then there is no fundamental basis for preferring 
9iiy = over having the curved metric agree up to a gauge transformation 
with the flat metric. 

Let the desired relation between the null cones hold at some initial mo- 
ment. Also let the curved spatial metric and shift be such at some event in 
that moment that they tend to make the curved cone violate the flat one a 
bit later. By suitably reducing the lapse, one can lengthen the curved cone 
until it once again is safely inside the flat cone. By so choosing the lapse at 
all times and places, one should be able to satisfy the causality principle at 
every event, if no global difficulties arise. In a rough sense, one might use 
up ^ of the gauge freedom of general relativity, while leaving the remainder. 
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It is possible that this procedure forces the lapse toward in some cases, 
which implies that physical events are stretched out over more and more of 
Minkowski spacetime, perhaps to future or past infinity. (This procedure 
bears a formal resemblance to the use of singularity-avoiding coordinates in 



numerical relativity |193|, in which the gauge (coordinate) freedom is used 
to exile singularities to infinite coordinate values. The obvious difference is 
that here the trick is suitably invariant.) 

Frequently it is assumed that the reason that the gravitational Hilbert 
action is gauge-invariant is because such gauge invariance reflects a deep 
feature of the world, general covariance. However, as we saw above, one can 
give a somewhat humbler explanation: it is known from the flat spacetime 
approach that eliminating the time-space components of the field is essential 
for positive energy 

properties in Lorentz-invariant theories though in fact the time- 

time component need not be [|^. We might suggest that the gauging away of 
the time-time component is necessary rather to respect ry-causality. So gauge 
invariance appears to be required to respect positive energy and special 
relativistic causality. 

3.2 The Causality Principle and Loose Inequalities 

As should be clear from the worries about conformally flat curved met- 
rics, the desired relationship between the two null cones takes the form of 
some loose inequalities a < b. Such relations have been called "unilateral" 
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[ p06 -210 or "one-sided" |211-213|, typical examples being nonpenetration 



conditions. Such constraints are rather more difficult to handle than the 
standard "bilateral" or "two-sided" constraint equations that most treat- 
ments of constraints in physics discuss. Loose inequalities are also more 
difficult to handle than strict inequalities a < b, such as the positivity con- 



ditions in canonical general relativity |214, 215, 231-233|, which require that 



the "spatial" metric be spatial. One might eliminate the positivity condi- 
tions by a change of variables [215| that satisfies the inequalities identically, 



such as an exponential function h = , as Klotz contemplates. 

If one does leave causality constraints in the theory, rather than solving 
them as was suggested in the previous paragraph, then one must worry about 
impulsive constraint forces, unless one makes the constraint "ineffective", 



so the constraint force vanishes on account of the constraint itself |22C]. 
However, it seems considerably more satisfactory to reduce the configuration 
space of the theory so that the problem is avoided altogether. 

3.3 New Variables and the Segre Classification of the Curved 
Metric with Respect to the Flat 

Previous formulations of the causality principle, which have used the metric 



[P8|,p£, 173, 174 1 or ADM variables as above, have been sufficiently incon- 
venient to render progress difficult. This was our third complaint about 
Logunov's formulation of ?7-causality. One could achieve a slight savings 
by using the conformally invariant weight — ^ densitized part of the metric 
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9fiu{—g)~^ ■ Then nine numbers at each event are required (the determinant 
being —1), which is a bit better than the 10 of the full metric, but still too 
many. 

One would like to diagonalize and tj^i, simultaneously by solving the 
generalized eigenvalue problem 

= Ary^^F^ (1) 

or perhaps the related problem using g^v{—g)~^ ■ However, in general that 
is impossible, because there is not a complete set of eigenvectors, due to the 
indefinite (Lorentzian) nature of both tensors |221- p26| (and references in 



( p26| ). There are four Segre types for a real symmetric rank 2 tensor with 
respect to a Lorentzian metric, the several types having different numbers 
and sorts of eigenvectors [ p22 -225]. 



To our knowledge, the only previous work to consider a generalized eigen- 
vector decomposition of a curved Lorentzian metric with respect to a flat 
oneQ was done by L Goldman [227], in the context of Rosen's bimetric the- 



ory of gravity, which does not use Einstein's field equations. Goldman's 
work was tied essentially to Rosen's theory, so it does not address our con- 
cerns much. The lack of gauge freedom in Rosen's theory also ensured that 
the curved null cone was not subject to adjustment, unlike the situation 



* There is also a literature on choosing coordinates to diagonalize a curved metric |228| . 
According to K. P. Tod, "there is very little change for Lorentzian metrics" compared to 



Riemannian ones |228 . But there is a large change for the eigenvalue problem of interest 
to us in changing from a Riemannian to a Lorentzian background metric, so the connection 
between these problems must be somewhat loose. 
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in general relativity with a flat metric. As it happens, in Rosen's theory, 
for static spherically symmetric geometries, the causality principle is always 
violated^ so we conclude that the theory is not consistent with special rel- 
ativity. However, Einstein's theory evidently has enough gauge freedom to 
make a special relativistic approach possible. 

An eigenvalue-eigenvector decomposition for the spatial metric was briefly 



contemplated by Klotz and Goldberg [214, 215|| . For space, as opposed to 
spacetime, one has a positive definite background (identity) matrix, so the 
usual theorems apply. But Klotz and Goldberg, who did not assume a flat 
metric tensor to exist, found little use for the eigenvector decomposition 
because of the nontensorial nature of the 3x3 identity matrix. Such a de- 
composition, even given a flat metric tensor, is still somewhat complicated 
if the ADM shift is nonvanishing (g^i 7^ 0), as it usually is. Diagonalization 
has been quite useful in the study of spatially homogeneous cosmologies 
[pl9| , but our interest is not in speciflc solutions only, but the general case. 

Let us now proceed with the diagonalization project, conflning our atten- 
tion to four spacetime dimensions. (A brief assertion without proof of a few 



results from the eigenvector formalism appeared recently [130|.) Given that 
a complete set of generalized eigenvectors might fail to exist, it is necessary 
to consider how many eigenvectors do exist and under which conditions. 
This problem has been substantially addressed in a different context by G. 



S. Hall and collaborators |222-225|, who were interested in classifying the 

stress-energy or Ricci tensors with respect to the (curved) metric in (geo- 
^Prof. Goldman has kindly provided this information from his dissertation in Hebrew. 
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metrical) general relativity. Such problems have in fact been studied over 
quite a long period of time p26| (and references therein), but we find the 
work of Hall et al. to be especially convenient for our purposes. There 
exist four cases, corresponding to the four possible Segre types (apart from 
degeneracies) for the classified tensor. The case {1,111} has a complete 
set of eigenvectors (1 timelike, 3 spacelike with respect to rf), and is thus 
the most convenient case. The case {211} has two spacelike eigenvectors 
and one null eigenvector (with respect to ry), whereas the {31} case has one 
spacelike eigenvector and one null one. The last case, {zzll} has 2 spacelike 
eigenvectors with real eigenvalues and 2 complex eigenvalues. 

We now consider the conditions under which metrics of each of these 
Segre classes obey r^-causality. To give a preview of our results, we state 
that the {1, 111} and {211} cases sometimes do obey it, although the {211} 
metrics appear to be dispensable. But no metric of type {31} or {zzll} 
obeys the causality principle, so these types can be excluded from consider- 
ation for the SRA. 

Hall et al. introduce a real null tetrad of vectors L^,A^^,X^,y^ with 
vanishing inner products, apart from the relations rf^^jL^N^ = rj^j^^X^^X'^ = 
rjfj^yY^Y^ = 1, so and N^^ are null, while X^^ and are spacelike. 
(The signature is — h ++.) Expanding an arbitrary vector as = 
V^Lt^ + V^Nf^ + y^X^ + V^Y>^ and taking the ??-inner product with each 
vector of the null tetrad reveals that = ri^^V^'N", = r]^^V^'L'', 
= r]fj,^V^'X'', and = 'nf.^VY" . Thus, the Kronecker delta tensor 



54 



can be written as 5^ = L^Nv + L^N^ + X^^X^ + Y^^Yj,^ indices being lowered 
here using r]^y. For some purposes it is also convenient to define the timelike 
vector T'^ = ^^^^ and the spacelike vector = ^'^^'^ ■ 



We employ the results of Hall et al. [p22| -225|, who find that the four 
possible Segre types (ignoring degeneracies) for a (real) symmetric rank 2 
tensor in a four-dimensional spacetime with a Lorentzian metric can be 
written in the following ways, using a well-chosen null tetrad. The type 
{1,111} can be written as 

g^y = 2poi(^iVz.) + Pi{L^L^ + N^N^) + P2X^X^ + paY^Y^, (2) 

or equivalently 

+ P2X^X, + p^Y^Y,. (3) 

As usual, the parentheses around indices mean that the symmetric part 
should be taken [|148(| . The type {211} can be written as 



g^y = 2piL(^iV^) + XL^Ly + piX^X^ + p^^Y^Y^, (4) 
with A 7^ 0, the null eigenvector being L'^. The type {31} can be written as 

g^y = 2/9iL(^iV^) + 2L(^X^) + piX^X^ + p2Y^Y^, (5) 
the null eigenvector again being L^. The final type, {zzll}, can be written 



as 



g^,^ = 2poL(^^N^) + pi{L^L^ - %N^) + P2X^,X^ + psY^^Y^, (6) 
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with pi 7^ 0. The requirements to be imposed upon the curved metric for the 
moment are the following: all jy-null vectors must be y-null or y-spacelike, all 
ry-spacelike eigenvectors must be 5-spacelike, g^i^ must be Lorentzian (which 
amounts to having a negative determinant), and g/^i, must be connected 
to 7]^^ by a succession of small changes which respect ?7-causality and the 
Lorentzian signature. It convenient to employ a slightly redundant form 
that admits all four types in order to treat them simultaneously. Thus, we 
write 

g^, = 2^L(^iV,) + BL^L, + C%N, + DX^X, + EYi^Y, + 2FL^^X,y (7) 

Using this form for 5^,^, one readily finds the squared length of a vector 
to be 

+2FV^V^. (8) 

It is not clear a priori how to express sufficient conditions for the causal- 
ity principle in a convenient way. But it will turn out that the necessary 
conditions that we can readily impose are also sufficient. 

3.4 NecessEiry Conditions for Respecting the Flat Metric's 
Null Cone 

The causality principle requires that the ?7-null vectors L'* and iV* be ^-null 
or fl'-spacelike, so B > 0, C > 0. These conditions already exclude the type 
{z ^11}, because the form above requires that B and C differ in sign. It must 
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also be the case that the 77-spacehke vectors X'* and Y^^ are 5-spacehke, so 
L> > and S > 0. 

Not merely L'* and AT'*, but all r/-null vectors must be 5-null or g- 
spacelike. This requirement quickly implies that E > A, and also requires 
that 

B{V^f + 2FV^V^ + {D-A){V^f>0. (9) 

Here there are two cases to consider: F = 1 for type {31}, and F = for 
types {1,111} and {211}. Let us consider F = 1. The {31} has 5 = 0, 
so the equation reduces to 2FV^V^ + {D — A){V^)'^ > 0, which implies 
that either = or, failing that, 2FV^ + {D - A)V^ > 0. Clearly one 
could also consider a null vector with the opposite value of V-^ , yielding 
the inequality 2FV'^ — [D — A)V'^ > 0. Adding these two inequalities gives 
4V^ > 0, which simply cannot be made to hold for all values of . Thus, 
the F = 1 case yields no ry-causality-obeying curved metrics, and the {31} 
type is eliminated. It remains to consider F = for the {1, 111} and {211} 
types. The resulting inequality is B{V^f + {D - A){V^)'^ > 0. Because 
B >0 has already been imposed, it follows only that D > A. 

Let us summarize the results so far. The inequalities B > and C > 
have excluded the type {z zll}. We also have D > 0, D > A, E > 0, E > A. 
Finally, F = excludes the type {31}, so only {1, 111} and {211} remain. 

We now impose the requirement of Lorentzian signature. At a given 
event, one can find a coordinate x such that (^)^ = and (flipping the 
sign of if needed for the orientation) a coordinate y such that i-§^)^ = Y^; 
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these two coordinates can be regarded as Cartesian. Then the null vectors 
and A^^ lie in the t — z plane of this sort of Cartesian system. The curved 
metric has a block diagonal part in the x—y plane with positive determinant, 
so imposing a Lorentzian signature means ensuring a negative determinant 
for the 2 X 2 t — z part. The vectors L^^ and iV* in one of these coordinate 
systems take the form = (L°, 0, 0, L^) and = (iV°, 0, 0, N^). Given the 
Cartesian form rj^i^ = diag{—l, 1, 1, 1) and the nullity of these two vectors, 
it follows that |L°| = IL^I and |iV°| = |iV3|. Therefore the relevant parts of 
the curved metric can be written in such a coordinate basis as 

goo = 2AL^N^ + B{L°f + C(iV°)^ 
£,03 = £,30 = -A{N^L^ + L^N^) - BL^L^ - CN^N^, 

g3S = 2AL^N^ + B{L^f + C{N^f. (10) 

Taking the determinant using Mathematica and recalling that |L°| = 
and lA^'^l = |A^^|, one finds that the condition for a negative determinant is 
2(^2 - BC)\L^\'^\N^\'^{sign{L^L^N°N^) - 1) < 0. The linear independence 
of L'* and iV* implies that sign{L^ N^) = —1, so the determinant 
condition is — BC > 0. Because B and C are both nonnegative, — 
BC > implies that A ^ 0. But the requirement that the curved metric 
be smoothly deformable through a sequence of signature-preserving steps 
means that the curved metric's value of A cannot "jump" from one sign of 
A to another, but must agree with the flat metric's positive sign. It follows 
that A>0. 
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We now summarize the necessary conditions imposed: 

A>0, A^> BC, B> 0, 
C > 0, D>A, E>A, 

F = 0. (11) 

3.5 Sufficient Conditions for Respecting the Flat Metric's 
Null Cone 

Thus far, it is not clear whether these necessary conditions are sufficient. 
We now prove that they are. It is helpful to consider the two types, {1,111} 
and {211}, separately. 

For the type {1, 111}, the conditions on the coefficients A, B, etc. reduce 

to 

^ > 0, A> B, B>0, 

C = B, D>A, E>A. (12) 

For this form the following relations between variables hold: 

A = pQ, B = pi, D = p2, 

E = p3. (13) 

It follows that this type can be expressed as 

g^c. = -{A- B)T^T, + {A + B)Z^Z, + DX^X, + EY^Y,. (14) 

Writing the eigenvalues for T^, X^^, Y^^, and Z^^ as D^, Dl, D\, and Df, 
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respectively, one has 

D^ = A-B, Dl = A + B, Dl = D, 

Dl = E. (15) 

One sees that the inequahties imply that the eigenvalue for the timelike 
eigenvector T'* (briefly, the "timelike eigenvalue") is no larger than any of 
the spacelike eigenvalues: 

D^o<Dl, D^Q<Dl D^o<Dl (16) 

and that all the (generalized) eigenvalues are positive. Let us now see that 
these conditions are sufficient. Writing an arbitrary vector as = 
yTrj,^i yx^ij, _^ yryiJ- + Zi^, one sees that its 7?-lcngth (squared) is 
r]nuV^'V'' = -{V^f + (V^f + (V^f + (V^f. Clearly this length is never 
more positive than ^^g^^uVV = -(F^)2+gj(y^)2+g|(yy)2+g|(yZ-)2^ 

so the necessary conditions are indeed sufficient for type {1, 111}. 

For the type {211}, the conditions on the coefficients A,B, etc. reduce 

to 

^ > 0, B>0, C = 0, 

D>A, E>A, F = 0. (17) 

One can write the curved metric in terms of T^, Z^^, X'^, and V^, though 
T'^ and Z^ are not eigenvectors. One then has 

g^, = -{A - ^B)T^n + {A + \b)Z^Z, + BZ^^T,^ + DX^X, + EY^Y,.{1^) 
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Writing an arbitrary //-spacelike vector field as = GT^+HZ^+IXf^+ 
JY^", with + + > G^, one readily finds the form of g^.uV^'V". 
Employing the relevant inequalities and shuffling coefficients, one obtains 
the manifestly positive result gf.^VV^ = A{H^ + + - G^) + ^B{G - 
Hf + {D- A)P + {E-A)J^. This positivity result says that all ry-spacelike 
vectors are g-spacelike. Earlier the requirement that all r/-null vectors be g- 
null or g-spacelike was imposed. These two conditions together comprise the 
causality principle, so we have obtained sufficient conditions for the {211} 
type also. 

The {211} type, which has with one null and two spacelike eigenvectors, 
is a borderline case in which the curved metric's null cone is tangent to 
the flat metric's cone along a single direction [ p29| . Clearly such borderline 
cases of {211} metrics obeying the causality principle form in some sense a 
measure set of all causality principle-satisfying metrics. Given that they 
are so scarce, one might consider neglecting them. Furthermore, they are 
arbitrarily close to violating the causality principle. We recall the criterion 
of stable causality in geometrical general relativity |148(] (where the issue 
is closed timelike curves, without regard to any flat metric's null cone), 
which frowns upon metrics which satisfy causality, but would fail to do 
so if perturbed by an arbitrarily small amount. One could imagine that 
quantum fluctuations might push such a marginal metric over the edge, and 
thus prefers to exclude such metrics as unphysical. By analogy, one might 
impose stable /y-causality, which excludes curved metrics that are arbitrarily 
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close to violating the flat null cone's notion of causality, though we saw 
that such a condition would exclude conformally flat metrics, also. Perhaps 
a better reason for neglecting type {211} metrics is that they are both 
technically inconvenient and physically unnecessary. Because 77-causality- 
respccting {211} metrics are arbitrarily close to {1, 111} metrics, one could 
merely make a small gauge transformation to shrink the lapse a bit more and 
obtain a {1, 111} metric instead. Thus, every curved metric that respects 
ry-causality either is of type {1,111}, or is arbitrarily close to being of type 
{1, 111} and deformable thereto by a small gauge transformation reducing 
the lapse. 

It follows that there is no loss of generality in restricting the configura- 
tion space to type {1,111} curved metrics, for which the two metrics are 
simultaneously diagonalizable. As a result, there exists a close relationship 
between the traditional orthonormal tetrad formalism and this eigenvector 
decomposition. In particular, one can build a ^-orthonormal tetrad field 

simply by choosing the normalization of the eigenvectors. This choice 
fixes the local Lorentz freedom of the tetrad (except when eigenvalues are 
degenerate) in terms of the fiat metric tensor. 

Rewriting the generalized eigenvector equation for the case in which a 
complete set exists, one can write QfivC^j^ = ry^i/C^D^, with the four eigen- 
values being the elements of the diagonal matrix D^. It is sometimes 
convenient to raise or lower the indices of this matrix using the matrix 
r]AB = diag{— 1,1,1,1). The tetrad field has {e^} has inverse {/^}. We 
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recall the standard relations g^v = f^^VABfu ^'^d gf^u^'^e'^ = Vab- It is 
not difficult to show the how the tetrad lengths are related to the eigenval- 
ues: r/^i/e^e^ = and equivalently, V^'^ f/^fu = D"^^ . It follows that 
= r]^ae%D^^ , which says that a given leg of the cotetrad can be 
expressed solely in terms of the corresponding leg of the tetrad e^, through 
a stretching, an index lowering, and possibly a sign change, without refer- 
ence to the other legs. Simultaneous diagonalization implies that the tetrad 
vectors are orthogonal to each other with respect to both metrics. 

3.6 Linearized Plane Waves a Difficulty for the Logunov 
Causality Principle 

As was stated in connection with Nicholas's drumming, monochromatic 
plane gravitational waves satisfying the linearization of the Einstein equa- 
tions and the Hilbert (linearized DeDonder) gauge appear to violate rj- 
causality in general. We will now show that in more detail, using Ohanian 
and Ruffini |192| as our guide, while making use of the eigenvalue technol- 



ogy introduced above. These results will also hold approximately for the 
Maheshwari-Logunov massive theory for large frequencies and weak fields. 

Defining a trace-reversed potential 4'^'^ = j'^'^ — i^'H^^l (with 7^ = 7) 
and imposing the Hilbert gauge d^(j)^'^ = 0, one puts the linearized Einstein 
equations in the form d'^cj)'^'^ = 0. Because we desire plane wave solutions, let 
\(|)^^^ = He^'^ cos{kax" + 'ip), where e^'^ is a constant polarization tensor, a 
constant polarization vector, and if is a small number fixing the amplitude. 
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We let the waves travel in the z-direction, so k'^ = a;(l, 0,0,1). We also 
define the vectors = (0, 1, 0, 0) and €3 = (0, 0, 1, 0). 

The gauge condition implies that the polarization tensor is orthogonal to 
the propagation vector: e'^^k^ = 0, leaving six independent solutions. One 
can take the six independent polarization tensors (with both indices lowered 
using the flat metric) to be: 
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(24) 



The first two, which are transverse-transverse and traceless, are the phys- 
ical (energy-carrying) polarizations in the massless theory. One can show 
that they induce effective curved metrics of type {1111}, but, partly due to 
the oscillations of the cosine function, they violate ry-causality, as the be- 
havior of the eigenvalues shows. The vector = (1,1,0,0) is r?-null, but 
is ^-timelike during every other half-period of the cosine function. Thus, 
this physical polarization violates 77-causality. The gauge waves do not seem 
helpful, because they all violate 77-causality, too. 

3.7 Dynamics of the Causality Principle 

Above it was shown that the tensorial DeDonder gauge, which for many 
purposes is the best gauge choice possible, leads to causality violations. It 
is doubtful that any better gauge choice exists, as far as the null cones are 
concerned, unless one gives up the condition that gfj,i, — rj/^i^ for weak fields. 
But this condition is not gauge-invariant, so giving it up is not so difficult 
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after all. To achieve null cone consistency, we renounce gauge-fixing, pace 
many authors (including ourselves!) |^|, 115, 11§| , 121, 130[ . 



Imposing stable ?7-causality gives the strict inequalities 

dI>1,D^>1,dI>1. (25) 



But strict inequalities can be solved, as Klotz suggested [214, pl5 |, and then 



the causality constraints would be eliminated. This goal is achieved by 
setting, for example, 

f)i = e" + l,52^e^ + l,5| = e'T' + l. (26) 

One could use these new variables to satisfy the causality constraints auto- 
matically. 

3.8 Finite Gauge Transformations and Orthonormal Tetrads 

The form of a finite gauge transformation for the densitized inverse metric 
tensor in the field formulation is known from the work of Grishchuk, Petrov, 
and A. D. Popova |91| to have the form 

g-p ^ e^^Q-P, u ^ e^«n, r]^, ^ ij,,, (27) 

in terms of the convenient variable g^^ = \/—gg'^'', the flat metric tensor, and 
matter fields u described by some tensor densities (with indices suppressed). 
They made use of a first-order action. Using a second-order form of the 
action, we will now confirm that this transformation indeed changes the 
action merely be a boundary term. We will also derive convenient formulas 
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involving different sets of variables. In addition, we will introduce a formula 
for finite gauge transformations of tetrad fields. 

We recall the bimetric form of the action above for a generally covariant 
theory|121|, with the metric here expressed in terms of the weight 1 inverse 



metric: 



Clearly the terms other than Si change at most by a boundary term, so our 
attention turns to Si = / d'^xCi. The important term £i in the Lagrangian 
density is just the sum of terms which are products of Q^'^ , u, and their 
partial derivatives. 

We now derive a useful formula. Writing out e'^^A as a series e^^A = 
Si^o h'^\^ ^'-'^ some tensor density A will put us in a position to derive a 
useful 'product' rule for the exponential of Lie differentiation. One could 
write a similar series for another tensor density B. Multiplying the series 



and using the Cauchy product formula |239 



ttiZ' bjZ^ = S CLkbn-kZ^ (29) 
i=Q j=0 n=0 fc=0 



and the n-fold iterated Leibniz rule |239 



M'"'-E^/<'V"-'. (30) 

one recognizes the result as the series expansion of e^^{AB), so one has the 
pleasant result 

{e^^A){e^^B) = e^^{AB) (31) 
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Using the fact that partial differentiation commutes with Lie differen- 
tiation, ones sees that replacing q^''^ by e'^^Q^'^ and u with e^^u in Ci will 
give e'^«£i. Thus, the change in Ci is 5Ci = (c^^ — which is the 

Lie derivative of a scalar density of weig ht 1. Recalhng fml that the Lie 



derivative a weight 1 scalar density (j) is £^<t) = {^^"(1)),^, one sees that 5jOi is 
just a coordinate divergence, as desired. 

In view of the matrix relationships among the various metric quantities, 
one has by definition that (g^'^ + SgP''^){Qpi, + dQpu) = 6p and various other 
relations. In that way, one can derive the form of SQp,^, 5g, Sgpu, and the 
like. Let us show this fact explicitly for g, using q'^p + 6q'^p = e'^^Q''^ . 
The determinant is given by \q"P\ = [Q;/xz/p](5°(5^(5^5^g"^g'^^g'^^0'"'^, where 
[a^jLup] is the totally antisymmetric symbol with [0123] = 1. Because this 
form for the determinant holds in any coordinate system, [a/xz/pJJjjJ^J^J^ is 
a scalar (and also a constant), so e'^^{[aiiup\5^p5}^6'^5^) = [aiJLvp\5^p5}^5'^6^. 
We therefore have 

= e^^\Q''P\. (32) 

Using q"P = g"Py/^, one recalls that \%"P\ = \gap\, so 

g + 5g = e^^g. (33) 

The relation —g — 5g = {\/—g + SyJ—g)"^ defines 5y/—g, so one quickly also 
finds that 

^+5V^ = e^«V^, (34) 
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with which one readily finds the result for g'^P and so on. Again the trans- 
formed field is just the exponentiated Lie derivative of the original. One 
therefore can readily use variables other than the densitized inverse metric. 

Grishchuk, Petrov, and Popova have exhibited a straightforward and 
attractive relationship between finite gauge transformations (with the expo- 
nentiated Lie differentiation) and the tensor transformation law []109 , IIC]. 



Evidently the former is fundamental, the latter derived. One can define a 
vector field E," using the fact that under a gauge transformation, g^'^ changes 
in accord with the tensor transformation law, while the flat metric stays 
fixed. Let us follow them and define in terms of the finite coordinate 
transformation 

x"" = e^'^eij^x". (35) 

Then the tensor transformation law, which is easy to use, gives the finite 
gauge transformation formula, which is difficult to use. One can therefore 
hope to avoid using the latter at all, and work only with the tensor transfor- 
mation law, which will tend to be simpler and will relate to known results 
in the geometrical theory. 

It appears that finite gauge transformations for an orthornormal tetrad 
formalism have never been studied before, so let us do so. If one imposes no 
requirements on the tetrad other than that it be orthonormal, then the for- 
mula is nonunique in the local Lorentz transformation matrix. The desired 
form turns out to be 

eU^e!; = e^^(OSe^ (36) 
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The Lie differentiation in the first factor acts on everything to its right. F is 
a matrix field which, when an index is moved using r]A_B = diag{ — l, 1, 1, 1) or 
rj^^, is antisymmetric: = —r]AEF§r]^'~^ . It is not difficult to verify that 
the above formula preserves both the completeness relation to the inverse 
metric g^^" = e^rj'^^e^^ and the orthonormality relation gfj.^e'^e'^ = rjAB- Let 
us now verify the completeness relation by showing that this relation with 
the gauge-transformed tetrad yields the gauge-transformed curved metric, 
using (c^s A)(e"^^5) = e'^i{AB). One has by definition of a variation A 
induced by this tetrad transformation, 

= e^n(e^)Se^r?'^^(e^)ie^]. (37) 

Acting with e~'^« gives 

+ Ag^n = (e^)Se^r/^^(e^)ie^. (38) 

We shall use the near-antisymmetry of F: F^ = —rjEjF^r}^'^. One then 
has 



M ( „F\C AB ( jE , jpE , TpE rpj 



= )!4r/""(/^ + F^ + Fj^F^ + . . )e% (39) 

where the one factor is expanded as a series. Continuing by moving the 
Lorentz matrix into strategic locations gives 
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= e'cie^)^(Ij - Fj + fM ' ■ ■ (40) 

where the near-antisymmetry of F has been employed. Reverting to the 
exponential form gives 

= (41) 

leading to the expected conclusion g'^'^ + Ag'^'^ = g^^^ + 5g'^^ = e^^g'^^ . 
Thus, completeness holds, and the tetrad-induced variation A of the inverse 
curved metric agrees with the gauge transformation variation 5. By simi- 
lar maneuvers, one establishes the orthonormality relation for this tetrad 
variation: 

(e^«5^.)(e^^ + 5e>XWs + ^^l) = r?^^. (42) 
Finally, the inverse tetrad transforms as 

fi + Sf^ = e^^{e-^)^f^, (43) 
with a negative sign applied to F. 

3.9 Gauge Transformations Not a Group, But a Groupoid 

If one is not interested in taking ry-causality seriously, then any suitably 
smooth vector field, perhaps subject to some boundary conditions, will gen- 
erate a gauge transformation. This is the notion that Grishchuk, Petrov, and 
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Popova have employed, and that on occasion we have used above. However, 
in the SRA, respecting ?7-causahty-indeed, preferably stable 77-causality-is 
essential. This fact entails that only a subset of all vector fields generates 
gauge transformations in the SRA. 

Let us be more precise in defining gauge transformations in the SRA, 
requiring stable ry-causality instead. A gauge transformation in the SRA is 
a mathematical transformation generated by a vector field in the form 



but we now introduce the requirement that both the original and the trans- 
formed curved metrics respect stable ry-causality. It is evident that a vector 
field that generates a gauge transformation given one curved metric and a 
flat metric, might not generate a gauge transformation given another curved 
metric (and the same flat metric), because in the second case, the trans- 
formation might move the curved metric out of the ?7-causality-respecting 
configuration space, which is only a subset of the naive configuration space. 

It follows that one cannot identify gauge transformations with generating 
vector fields alone. Rather, one must also specify the curved metric prior 
to the transformation. For thoroughness, one can also use the fiat metric as 
a label, to ensure that the trivial coordinate freedom is not confused with 
the physically significant gauge freedom. Let us therefore write a gauge 
transformation as an ordered triple involving a vector field, a flat metric 
tensor fleld, and a curved metric tensor fleld: 




(44) 
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where both g^i, and e^g^i, satisfy stable causahty with respect to r]^i,. 
The former restriction limits the configuration space for the curved met- 
ric, whereas the latter restricts the vector field. (At this point we drop the 
indices for brevity.) The non-Abelian nature of these transformations makes 
it advisable to use not ^, but the operator e^^, to label the transformations, 
because then the noncommutativity of two transformations is manifest. 

One wants to compose two gauge transformations to get a third gauge 
transformation. At this point, the fact that a gauge transformation is not 
labelled merely by the vector field, but also by the curved and flat metrics, 
has important consequences. Clearly the two gauge transformations to be 
composed must have the second one start with the curved metric with which 
the first one stops. We also want the flat metrics to agree. Thus, the 'group' 
multiplication operation is defined only in certain cases, meaning the gauge 
transformations in the SRA do not form a group, despite the inheritance 
of the mathematical form of exponentiating the Lie differentiation operator 
from the field formulation's gauge transformation. Two gauge transforma- 
tions (e"^*, 772, 52) and {e-^i ,r]i, gi) can be composed to give a new gauge 
transformation {e^* ,r]2-,g2) o (c^^ , 51) only if g2 = e^^g\ and 772 = "Ui- 
The left inverse of (e'^f , ryi , 31 ) is {e'^-i,r]i, e^^gi), yielding {e^-i ,r]i,e^igi)o 
{e'^i ,r]i, gi) = {l,r]i,gi), an identity transformation. The right inverse is 
also {e-^-i,rii,e-^igi), yielding {e^i,rii,gi)o{e^-i,rii,e^igi) = (1, 7?i, e'^f 51), 
which is also an identity transformation. Gauge transformations in the SRA 
do not form a group, because multiplication between some elements simply 
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is not defined. While the lack of a group structure is perhaps unfamiliar, 
there is a known mathematical structure that precisely corresponds to what 
the physics of the SRA dictates. According to A. Ramsay, "[a] groupoid 
is, roughly speaking, a set with a not everywhere defined binary operation, 



which would be a group if the operation were defined everywhere." |235] 
(pp. 254, 255) One need not rest with informal descriptions, because one 
can easily show that SRA gauge transformations in fact satisfy the axioms 



required of a groupoid, as defined by P. Hahn \23t\ and J. Renault |237]. 



Though groupoids are increasingly coming to the attention of physicists, 
they are sufficiently obscure that we reproduce the definition of Hahn and 



Renault here for convenience. According to them |236, 237 1, a groupoid is a 
set G endowed with a product map (x, y) ^ xy : G"^ ^ G, where G^ is a 
subset of G X G called the set of composable [ordered] pairs, and an inverse 
map X ^ x^^ : G ^ G such that the following relations are satisfied: 



1. (x ^) ^ = X, 



2. if and {y,z) are elements of G^, then {xy,z) and {x,yz) are 
elements of G^ and {xy)z = x{yz), 

3. {x~^,x) G G^, and if (x, y) £ G^, then x~^{xy) = y, 

4. {x,x~^) £ G^, and if {z,x) G G^, then (zx)x~^ = z. 

One readily interprets this definition as implying that every SRA gauge 
transformation has an inverse, and that multiplication is associative when- 
ever it is defined. 
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As we saw above, authors such as Grishchuk |94| and A. N. Petrov |113] 



have denied that the flat metric's null cone can have any physical significance 
in part because the relation between the two metrics' null cones is gauge- 
variant. While this objection holds if one insists that gauge transformations 
must form a group, such insistence seems unwarranted. In accord with the 
demands of the SRA, we define gauge transformations such that they respect 
the causal structure of the flat metric, and find that gauge transformations 
form a groupoid. Thus this objection to ascribing physical significance to 
the fiat metric's null cone is removed. 

3.10 Canonical Quantization in the SRA 

The relevance of the special relativistic approach to Einstein's equations 
to canonical quantum gravity deserves some consideration. The primary 
patrons of the flat metric in the context of Einstein's equations have been the 
particle physicists in the context of the old covariant perturbation program of 
quantization. However, this program famously appears to be perturbatively 
nonrenormalizable, even with the addition of carefully chosen matter fields 



in the later supergravity era |243]. Therefore, the covariant perturbation 



program has largely been abandoned.^ 



^An exception is some recent work by G. Scharf and collaborators such as I. Schorn, N. 



Grillo, and M. Wellmann (for example, [123, 128 ). Their use of "causal" methods helps to 



achieve finite results. Another approach [ pSq] , based on an old suggestion by S. Weinberg, 
is that the theory might be nonperturbatively renormalizable. Recently O. Lauscher and 



M. Renter have argued that this situation likely is realized ||238 
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With the patrons of the flat metric having largely diverted their attention 
to strings, membranes, and the like, one might form the belief that the use 
of a flat background metric has nothing further to contribute to quantum 
gravity, and in particular, to canonical quantum gravity. Isham writes of 
the null cone issue in the covariant perturbation program: "This very non- 
trivial problem is one of the reasons why the canonical approach to quantum 
gravity has been so popular." [p43| (p. 12) And again, "One of the main 
aspirations of the canonical approach to quantum gravity has always been to 
build a formalism with no background spatial, or spacetime, metric." |243] 



(p. 18) The use of a flat background in canonical gravity indeed seems to 
be rather rare, apart from some work in the field formulation by Grishchuk 
and Petrov |110(| , which does not consider the flat metric's null cone. 

However, it would be a mistake to conclude that the canonical formalism 
is immune to similar worries, worries which a flat background's null cone 
structure could address. Isham continues: 

However, a causal problem arises here [in the canonical approach] 
too. For example, in the Wheeler-DeWitt approach, the config- 
uration variable of the system is the Riemannian metric qab{x) 
on a three-manifold S, and the canonical commutation relations 
invariably include the set 

[qab{x)Acd{x')]=^ (45) 

for all points x and x' in S. In normal canonical quantum field 
theory such a relation arises because S is a space-like subset of 
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spacetime, and hence the fields at x and xi should be simultane- 
ously measurable. But how can such a relation be justified in a 
theory that has no fixed causal structure? The problem is rarely 
mentioned but it means that, in this respect, the canonical ap- 
proach to quantum gravity is no better than the covariant one. 
It is another aspect of the 'problem of time' .... |243(| (p. 12) 

Evidently introducing a flat metric can help: 

The background metric r/ provides a fixed causal structure with 
the usual family of Lorentzian inertial frames. Thus, at this level, 
there is no problem of time. The causal structure also allows a 
notion of microcausality, thereby permitting a conventional type 
of relativistic quantum field theory ... It is clear that many of the 
prima facie issues discussed earlier are resolved in an approach 
of this type by virtue of its heavy use of background structure. 
H (p. 17) 

What then is the difficulty? 

However, many classical relativists object violently . . . , not least 
because the background causal structure cannot generally be 
identified with the physical one. Also, one is restricted to a 
specific background topology, and so a scheme of this type is 
not well adapted for addressing many of the most interesting 
questions in quantum gravity: black hole phenomena, quantum 
cosmology, phase changes etc. |243| ] (p. 17) 
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However, above we have presented a formalism which plausibly ensures en- 
sure that the physical causal structure is consistent with the background 
one by construction. Thus, this first objection is answered. The second 
objection is strong only if one already knows that gravitation is geometrical 
at the classical level. But such a view is hardly mandatory. We conclude 
that it would be interesting to investigate the canonical quantization of Ein- 
stein's equations within the special relativistic approach, because serious 
conceptual problems with standard approaches would evidently be resolved. 

4 Global Hyperbolicity, Black Hole Information 
Loss, and a Well Posed Initial Value Formula- 
tion 

With the requirement of r/-causality imp osed-per haps stable ?7-causality us- 
ing the causality variables-it follows that any "SRA spacetime" (i?^, rj^^^, g^y) 
is globally hyperbolic. How so? It follows from //-causality that the future 
domain of dependence of an //-spacelike slice is in fact the whole of B!^. But 
global hyperbolicity just is the possession of a Cauchy surface |148], so any 



r/-causal SRA spacetime {R ,r]^,y, gf^i^) is globally hyperbolic. Global hy- 



perbolicity resolves the Hawking black hole information loss paradox |19C]. 
Thus, there is no evolution from a pure state to a mixed state, and so no in- 
formation is lost. If global hyperbolicity solves the problem, then so does the 
SRA. Evidently the geometrical interpretation of Einstein's equations helps 



78 



to generate this paradox, so the SRA becomes somewhat more attractive on 
conceptual grounds. 

If the nature of black holes in the SRA is altered, one might wonder what 
becomes of the work on black hole entropy. As J. Oppenheim has shown 
recently |191| ] , the proportionality of black hole entropy to area does not 
depend on the existence of an event horizon, 

but merely occurs in the limit as a gravitating system approaches its 
gravitational radius. Inclusion of the gravitational field in thermodynamics 
yields a correction term that violates entropy extensivity; in the limit as the 
radius approaches the Schwarzschild radius, the entropy is proportional to 
area rather than volume. 

One could further ask whether the SRA has a well posed initial value 
formulation. The use of harmonic coordinates has been a common technique 



for answering this question in the geometrical formulation [145], in which 
the choice of harmonic coordinates constitutes a gauge-fixing. Given that 
the SRA has somewhat restricted gauge freedom, and that the tensorial 
DeDonder gauge condition (which makes the coordinate ^-harmonic when 
r/-Cartesian) has causal difficulties for plane wave solutions, one might fear 
that the proofs of a well posed initial value formulation fail for the SRA. 
However, such a fear is groundless, as we see if we resist the temptation to 
use ?7-Cartesian coordinates, for which we have no need. The SRA has both 
coordinate freedom and gauge freedom. The choice of (7-harmonic coordi- 
nates, if the flat metric is unfixed, is merely a coordinate choice, leaving the 
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gauge unfixed. The eigenvalues, which express the relation between the two 
null cones, are coordinate scalars. The naive gauge freedom has not been 
used at all, and thus is fully available for deforming the curved metric until 
it becomes consistent with the flat one by reducing the lapse. Therefore the 
traditional harmonic coordinate approach to demonstrating a well posed ini- 
tial value problem experiences no obstacles from the SRA. The SRA indeed 
has a well posed initial value formulation, and one need not even appeal to 
recent work that permits coordinate freedom [242| to show it. 



5 Conclusion 

We have aimed to take special relativity seriously, including its causal struc- 
ture, while viewing gravity as described by Einstein's field equations. In 
reviewing the history of the Lorentz-covariant approach, we found that the 
fundamental issue of causality has frequently been ignored or distorted, and 
evidently has never been handled correctly. Next we found an adequate 
kinematical description of the relation between the null cones, and found 
that all metrics obeying stable r/-causality, and almost all metrics obeying 
r/-causality, possess in effect an orthonormal tetrad of eigenvectors. Plau- 
sibly one can deforming any physically relevant solution into one in which 
the proper null cone relation obtains. Having done so, one can adopt a new 
set of variables which ensure that the proper relation holds automatically. 
Gauge transformations form not a group, but a groupoid. As a result of 
using the flat metric, the problem of deflning causality in quantum gravity 
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is solved. Furthermore, every SRA solution is globally hyperbolic, so the 
Hawking black hole information loss paradox does not to arise. 
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